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Treating the b quark distribution function with reliable uncertainties 



00 
O 

o 

o 

CD 

Q 



Zoltan Ligeti, 1 Iain W. Stewart, 2 and Frank J. Tackmann 1 

1 Ernest Orlando Lawrence Berkeley National Laboratory, University of California, Berkeley, CA 94720 
2 Center for Theoretical Physics, Massachusetts Institute of Technology, Cambridge, MA 02139 

The parton distribution function for a b quark in the B meson (called the shape function) plays an 
important role in the analysis of the B — > X s 7 and B — > X u lv data, and gives one of the dominant 
uncertainties in the determination of \V u b\- We introduce a new framework to treat the shape 
function, which consistently incorporates its renormalization group evolution and all constraints on 
its shape and moments in any short distance mass scheme. At the same time it allows a reliable 
treatment of the uncertainties. We develop an expansion in a suitable complete set of orthonormal 
basis functions, which provides a procedure for systematically controlling the uncertainties due to 
the unknown functional form of the shape function. This is a significant improvement over fits to 
model functions. Given any model for the shape function, our construction gives an orthonormal 
basis in which the model occurs as the first term, and corrections to it can be studied. We introduce 
a new short distance scheme, the "invisible scheme", for the kinetic energy matrix element, Ai. We 
obtain closed form results for the differential rates that incorporate perturbative corrections and 
a summation of logarithms at any order in perturbation theory, and present results using known 
next-to-next-to-leading order expressions. The experimental implementation of our framework is 
straightforward. 
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I. INTRODUCTION 

The determination of the Cabibbo-Kobayashi- 
Maskawa (CKM) matrix element \V u b\ from inclusive 
scmilcptonic B — y X u lv decays suffers from large 
B — > X c lv backgrounds. In most regions of phase space 
where this background is kinematically forbidden, the 
hadronic physics enters via unknown nonperturbative 
functions, so-called shape functions. At leading order 
in Aqcd/to&, there is only one such function, which 
can be extracted from the photon energy spectrum in 
B — > X s j 0, [U and used to predict various B — ► X u £v 
spectra. In B — > X s ~/ it is the main uncertainty in 
the effect of the cut on the photon energy, which is 
near the border of the region where a local operator 
product expansion (OPE) is applicable. Due to exper- 
imental cuts, the shape function is also important for 
B -> X s £+£- 0,13. 

The determination of \V u b\ received renewed attention 
recently, since the measurement of sin 2/3 favors a some- 
what smaller value of \V u b\ than its determination from 
inclusive decays. One of the most sensitive tests of the 
standard model flavor sector comes from comparing the 
sides and angles of the unitarity triangle, so it is impor- 
tant to determine \ V u b\ with minimal model dependence. 
Refined calculations of the B — > X s -f rate [5j also provide 
stringent constraints on new physics. 

To obtain \V u b\ as precisely as possible, one should 
combine all existing information on the shape function. 
The shape function is constrained by the measurements 
of the shape of the B — ► X s "f photon energy spec- 
trum p, 0, [I[ and the mx spectrum in B — > XJLv Q , 
and its moments are related to the b quark mass, nib, and 
nonperturbative matrix elements of local operators in the 
OPE, which are constrained by fits to B — * X c lv de- 
cay distributions [l(| EH EH ■ In addition, the tail of the 



shape function as well as its renormalization group evolu- 
tion (RGE) can be calculated perturbatively. A problem 
is that an arbitrarily small renormalization group run- 
ning of the shape function develops a perturbative tail 
whose moments diverge [l3j |. 

Currently, there are several approaches to determine 
\V u b\- Often a model for the shape function is chosen, 
which has a fixed functional form roughly consistent with 
the B — y X s j spectrum, and a few adjustable parame- 
ters that are fixed by imposing constraints on the first few 
moments of the shape function. A proposal [l4[ used by 
many experimental analyses involves defining moments 
of the shape function with a cutoff, and a particular 
procedure to attach a perturbative tail to the model. 
Unfortunately, there is no clear way to disentangle the 
shape function and nib dependencies in this approach, 
and experimental uncertainties in the shape of the mea- 
sured B — y X s ~f spectrum are not easily incorporated. 
The issues related to modeling the shape function can 
be avoided using model independent relations between 
B —y X u lT> partial rates and weighted integrals of the 
B —y X s j spectrum [IE EH EH EK , which use the mea- 
sured B —y X s j spectrum directly as input to predict the 
B —y X U £D rates. Although this weighting method allows 
one to take into account the experimental uncertainties 
from B — > X s ~{ straightforwardly, it is hard to combine 
several measurements, and there is no way to include the 
additional constraints on nib and the heavy quark effec- 
tive theory (HQET) matrix elements. Phase space cuts 
for which the rate has only subleasing dependence on 
the shape function are also possible [lj|, at the expense 
of increasing the size of the expansion parameter. 

A fully consistent method to combine all experimen- 
tal constraints on both the shape and moments of the 
shape function, while also incorporating its known per- 
turbative and nonperturbative behavior, has not yet been 
given. The framework proposed in this paper provides 
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such a method. It also allows one to obtain reliable er- 
ror estimates by (i) taking into account all experimental 
and theoretical uncertainties and correlations; and (ii) 
estimating the uncertainty related to the unknown func- 
tional form of the shape function in a systematic fashion. 

The shape function, S(ui,fi), contains nonpcrturbative 
physics and obeys the renormalization group equation 

S(iJ,Hi) = J dJ U s (u> - uj' , fa, pa) S(u)',ha) , (1) 

where the evolution kernel Us(u>, Hit Ha) sums logarithms 
between the two scales Hi > I 1 A- The question is how to 
determine the function S(u>, h) reliably, which can then 
be used to extract \V u b\ from B — > XJLv, and to analyze 
B — * X s ^f or B — > X s £ + £~ in the low-g 2 region. In 
Ref. 3 we used the construction 



S(u,ha) 



dfcC (w - k,HA)F(k) 



(2) 



where Cq{u),ha) is the b quark matrix element of the 
shape function operator calculated in perturbation the- 
ory, and F(k) is a nonperturbative function that can be 
extracted from data. Equation ((2|) has many advantages 
compared to earlier treatments of the shape function. It 
ensures that: 

1. S(u),(1a) has the correct ha dependence and RGE. 

2. S(u,ha) has the correct perturbative tail at large 
u>, while for small u> it is determined by F{k). 

3. The moments of F(k) exist without a cutoff and 
F(k) falls off exponentially at large k. 

4. Information about matrix elements of local opera- 
tors in any short distance scheme can be incorpo- 
rated via constraints on moments of F(k). 

A construction similar to Eq. ([2]) was also used to treat 
the soft function that describes nonperturbative radia- 
tion in jet production in Ref. (20| . 

The outline of this paper is as follows. In Sec. |TT] we 
set up our notation and discuss how the shape function 
enters the decay rates. Our new treatment of the shape 
function based on Eq. @ is discussed in Sec. IIIII includ- 
ing the procedure for incorporating moment constraints 
in any short distance scheme and an analysis of perturba- 
tive corrections in the shape function and decay rate up 
to two- loop order with a resummation of large logarithms 
at next-to-next-to-leading-logarithmic (NNLL) order. In 
Sec. [IV] we introduce a systematic expansion of F(k) in 
terms of a suitably chosen set of orthonormal basis func- 
tions, which allows one to control the uncertainties aris- 
ing from its unknown functional form. In Sec.fVl we sum- 
marize our proposal of how to use all the available data to 
extract the function F(k) and determine the shape func- 
tion S(lo,h) with reliable uncertainties, which can then 
be used for the extraction of \V u b\ from B — > X u lv. Sec- 
tion lVII contains our conclusions. Details on perturbative 
corrections and the invisible scheme for the kinetic energy 
matrix element are summarized in three appendices. 



II. THE B X s -y AND B -» X u li> RATES IN 
THE SHAPE FUNCTION REGIONS 

We use the kinematic variables p x = Ex T \px \ ■ We 
also define the partonic variable p~ — p x + mj, — tub ■ In 
B — > Xj7, p x = m B (jp~ = m b ) and p\ = m B - 2E 1 , 
while in B — > X u £v they are independent variables with 
p x < p x < tub- There are three cases where it is known 
how to carry out a systematic expansion of the decay rate 



1) Nonpert. shape function 

2) Shape function OPE : 

3) Local OPE : 



Aqcd ~ P x < Px > 
Aqcd < Px < Px > 
Aqcd < P x ~ Px ■ ( 3 ) 



The region 2) was first studied in B — * X u lv in Refs. [2l|, 
[Til, and for the B -> X s ~/ rate in Refs. (H H^. In the 
SCET regions 1) and 2), where p x <^P X , the decay rates 
T s = T(B -> X sl ) and T u = X u lv) are given by 

the factorization theorems [li, 25 1 



dr s 

dE-, 



dr„ 



dEf dp x dp x 



2T 03 H s {p+,Hi) (4) 

x J dcjm h J(m h u;, Hi) S(p x ~ uj, Hi) , 

T Qu H u (E e ,p x ,p x ,Hi) 

x dui p~ J (p~ u, Hi) S(p x -w, Hi), 



where 



ros — 



87T 3 47T 



Way. 



* 1 2 



tbV ts \ , 



1927T 3 



\V ub 



(5) 

Corrections are suppressed by Aqcd /mb and it is known 
how to include them in Eq. (J3J). Here we focus on the 
leading term since the procedure to incorporate the sub- 
leading terms follows the same method. The integration 
limits are implicit in the support of the 5* and J functions 
in Eq. ([¥]) , which are nonzero when their first argument is 
positive. Both, the hard functions, H u and H s , and the 
jet function, J, in Eq. (0| are calculable in a perturbation 
series in a s . We summarize results for them in App. [5] 
up to two-loop order. Only H u and H s are process de- 
pendent, and they can sum logarithms between the hard 
scale and /if ~ P X P X ■ 

The shape function S(u>, Hi) m Eq. ((3]) sums logarithms 
between Hi and ha through Eq. |T]), where in case 1) in 
Eq. (O HA ~ 1 GeV, while in case 2) ha ~ p\- In case 
1) the shape function is nonperturbative, while in case 2) 
it can be computed with an OPE to separate the scales 
Aqcd <SC p x ■ A key feature of Eq. @ is that it makes the 
expressions for the decay rates in Eq. simultaneously 
valid both for cases 1) and 2). For example, it allows an 
analysis of the photon energy cut in B — > X s j without 
having to rely on the expansion in region 2) in Aqcd/Px' 
as was done in Refs. [22, 23]. This is important, since in 
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practice, if the momenta in region 2) are not well sepa- 
rated numerically, the utility of expanding in ^qct>IPx 
is unclear. 

It is possible to make Eq. ([JJ valid for region 3) of 
Eq. ([3]), by including appropriate power suppressed and 
perturbative corrections. At tree level this was carried 
out in Refs. [H, [S3|> and an results presented below are 
valid in region 3) at this order. At the level of pertur- 
bative corrections these issues were studied in Ref. [28| . 
We do not include the additional perturbative correc- 
tions needed in region 3), since our primary interest is to 
study the SCET regions 1) and 2). However, it is impor- 
tant that for the perturbative corrections to be correct 
in region 3), it is required to scale up the different /x's so 
that ha = Hi = fib ~ nib, because there is only one scale 
H in the local OPE. A procedure to carry out this scal- 
ing of the h' s is discussed around Eq. (|36|) . A dedicated 
study of the transition to region 3) is left for future work. 

Combining Eqs. (JTJ) , and ©, and switching the 
order of convolutions we arrive at 



dr s 



2T 0s H s (pj c ,H l 



dr„ 



x J dkP(m b ,Px -k,Hi)F(k), 

_- = r 0u H u (Et,px,Px,m) 

dtLg dp x dp x 

x fdkP(p-, Px -k,Hi)F(k). (6) 

Here, the perturbatively calculable function P is process 
independent, 

P(p~,k,Hi) = Jdu fdu'p~ J\p~(k — w),Hi] 

xU s {uj-uj',Hu^A)Co{uj\Hh)- ( 7 ) 
At lowest order in perturbation theory 

P{p-,k,Hi) = S(k) + 0(a s ), (8) 

and the result for P up to 0(a 2 s ) with NNLL resumma- 
tion is given in App. [A] 

Equation (J5]) can be used to determine the F function 
by fitting to experimental B — > X s ^ and B — > X u £v data. 
We return to this in Sec. [V] In the next two sections 
we explore Eq. j2j) in detail and construct a complete 
orthonormal basis for F(k) that is suitable for carrying 
out these fits. 



III. GENERAL TREATMENT OF THE SHAPE 
FUNCTION 

A. Master formula and OPE constraints 

The shape function S(cu, h) is the B meson matrix el- 
ement 

S(u,/i) = (B\O (lo,h)\B) = (O (lu,h))b, (9) 



of the operator 

O (u,ii) = b v 5(iD+-5 + u)b v , (10) 
where we defined 



m B - m b . 



(11) 



Here, b v is the HQET b quark field and \B) is the full 
QCD B meson state, respectively. (If we used the HQET 
\B V ) state, this would correspond to absorbing time- 
ordered products of Oq{u>) with all power corrections 
in the HQET Lagrangian into the definition of S(u>,h)-) 
Also, D + = n-D, v is a timelike vector, and n is a lightlikc 
vector with n-v = 1. For our application, v = pb/ttib is 
the four- velocity of the B meson, and ft = px/\px \ is the 
direction of the light-quark jet. The h dependencies of 
S(u),h) and Oq(uj,h) are the same. In Eq. pU|) . our use 
of 5 = rriB — mb and the \B) state ensures that S(ui, At) 
has support for ui > with any mass scheme for nib [27j | . 
Note that d explicitly depends on the mass scheme. We 
will use the pole mass scheme first, and discuss convert- 
ing to short distance schemes in the next subsection. 

The information about the shape function that can be 
obtained from perturbation theory arises from the fact 
that when integrated over a large enough region, < 
lu < A, such that perturbation theory is reliable at the 
scale A, the operator Oq(oj) can be expanded in a sum of 
local operators, 



Q (U),H) = ^ C n (OJ, H) Qn + ■ ■ 
2 

= ^ Cn(v - Qn 



(12) 



n=0 



where we use either of the operator bases 



Q n = b v (iD+-S) n b v , Q n = b v (iD+) n b v . (13) 

The Wilson coefficients C n (ui) for these two bases are 
equivalent, because Oq(u)) only depends on the combi- 
nation iD + — 8 + lu. The ellipses in Eq. (fT2"|) represent 
operators of dimension six and higher (where four-quark 
operators first appear). Taking the B meson matrix ele- 
ment of Eq. (TT2"|) . 



(14) 



S(uj,h) = X! c n{u,H) (Qn)B 



the moments of S(uj, h) with an upper cutoff can be com- 
puted 

pA pA 

/ duj uj k S(lj,h) = y)(Qn)fl/ duj uj k C n (uj 7 H) + ■■■ , 
Jo n Jo 

(15) 

and are determined by the local matrix elements (Q u )b 
plus the perturbative information in the C n [Tij . For the 
first few matrix elements, we have 

(Qo)b = 1, (Qi)b = S, (Q2)b = -^ + S 2 , (16) 
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where Ai = (B\b v (iD) 2 b v \B), with the matrix element 
defined in dimensional regularization. With this defini- 
tion and the pole mass, the matrix elements in Eq. (fl6|) 
are [i independent. 

The matching coefficients C n (oj,fi) in the OPE in 
Eq. IT2|) can be determined at fixed order in perturbation 
theory by taking a partonic matrix element of both sides 
of Eq. (JEj). Consider [H[ 

(b v \Oo(u>+S, ft)\b v ) = C n (cu,n) (b v \Qn\b v ) =C (w,/x), 

n 

(17) 

where the b v states have zero residual momentum, and 
we used (b v \Q n \b v ) = 5o n . The n > 1 matrix elements 
vanish in MS because there is no dimensionful quantity 
they can be proportional to. To determine C n (uj,n) for 
n > 1, consider the matrix element between b v states 
with residual momentum k* 1 where v ■ k = but k + = 
n-k^O. The right-hand side of Eq. (12]) gives [H| 



(b v (k+)\O (u + 5, fi)\b v (k+)) 

= (b v (0)\O (u + k + + 5, f*)\b v (0)) 

kl d"C (a;,//) 
n\ dtJ n 



C Q (uj + k + ,n) = V-t' 

A ✓ « 



(18) 



Comparing this with the left-hand side of Eq. ([12 
(b v (k+)\0 (co + S, n)\b v (k+)) 

2 

= C n(^' M) (b v (k + )\Q n \b v (k+)) + ... 



n=0 
2 



n=0 

gives for n = 0, 1, 2 



C n (uj,fx) 



1 d n C {uj,y) 
n\ duj" 



(19) 



(20) 



In Eq. (Tl9|) . the matrix elements of Q„ for n < 2 in 
MS are given by their tree- level values, fc", because loop 
graphs have dimension > 1, but are scaleless and vanish. 

The coefficients of Qi and Q 2 are related by Eq. (|20|) 
to the same perturbative coefficient function as Qq to all 
orders in perturbation theory. This is no longer the case 
at dimension six and higher, where the operator basis 
includes four-quark operators, and more than one matrix 
element must be computed. 

Our key point is to write the renormalized shape func- 
tion at the scale fj, as in Eq. ^ , 



S(u,n)= dkC (co-k,fj,)F(k) 



(21) 



where the function Co(k>,/i) = (b v \Oo(u> + 5,/j,)\b v ) has 
an expansion in a s and contains perturbatively accessi- 
ble information about S(w,[i). Equation (pH]) defines the 
function F(k), which is a nonperturbative object that can 



be extracted from data. To see that Eq. (|2"Tj) uniquely 
specifies F(k), note that in Fourier space S(y,fi) — 
C (y,fi)F(y), so F(y) = S(y, n)/C {y, m)- 

An important feature of Eq. (|21[) is that it is consistent 
with the OPE result in Eq. (Tl4|) . Expanding its right- 
hand side in k gives 



1 d n C Q (uj,n) 
n! duj n 



dk{-k) n F{k), (22) 



and comparing with Eqs. (fT4|) and (|20|) one finds for n 
0,1,2 



dfc fc"F(fc) = (-1)™ (Q n ) E 



(23) 



Thus, the first few moments of F(k) are determined by 
the matrix elements of the local operators, {Qu)b, repro- 
ducing the OPE for these terms. (The decomposition 
in Eq. (|2Tj) can be extended such that it works for the 
OPE terms with n > 3 as well, although we will not do 
so explicitly here.) Unlike for S(u>,ii), for F(k) all mo- 
ments without a cutoff exist, so F(k) falls faster than 
any power of k at large k. Furthermore, the characteris- 
tic width over which F(k) has substantial support is of 
order Aq Cd . 

Equation (f2Tj) also ensures that S(uj, /x) has the correct 
dependence on [i in the MS scheme, since it is determined 
by Co(w,/i), which satisfies the shape function RGE in 
Eq. (P). As shown in Eq. (|A23j) of App. [A] a simple 
formula valid to all orders in a s can be derived, which 
combines S(u>, from Eq. (|2ip with the evolution from 
fiA up to [it in Eq. (P), 

oo j + 1 

S(u,iH) = E s (w,fH,iJLA)Y2 ( 24 ) 
j=-i e=-i 

pi 



a a {n A ),—\ I dz£%(z)F[w(l-z) 



x Sj 



Here, Cj(z) = [\n e (z)/z 1 ^} + is defined in Eq. (JB9J), and 
Sj, Eg, rj — T)(ni, ha), and Vj?(r)) are given in Eqs. (|A12I) . 
(jA16jl . (|A17|) . and (|B17|I . respectively. The ^ coef- 
ficients are determined by partonic fixed-order calcula- 
tions of Co(w,/i), and at ©(a^) the sum is bounded by 
j < 3. The RGE factors 77 and -Eg are determined using 
the anomalous dimensions at various orders. The RGE 
here has Sudakov double logarithms, implying that the 
"cusp" anomalous dimension, r cusp , must be included at 
one higher order than the standard anomalous dimen- 
sions, "f x . There is no large hierarchy between the scales 
/it > Mi > MA) so the optimal procedure for combining 
fixed order and resummation is debatable. We adopt the 
following conventions for our analysis: 

LL : 1-loop r cusp , tree-level matching ; (25) 
NLL : 2-loop r cusp , 1-loop "f Xl 1-loop matching; 
NNLL : 3-loop r cusp , 2-loop j x , 2-loop matching. 
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fi A = 2.5 GeV 
Ma = 1.8 GeV 
Ma = 1.3 GeV 
Ma = 1-0 GeV 




1 1.5 

w [GeV] 

FIG. 1: Scale independence of our shape function construc- 
tion. The thin black dashed curve with the highest peak 
shows the model function S{u,(jh) = F ulod (ijj) given in 
Eq. (|34|1 . The other dashed curves show the result of tak- 
ing S(to,(i A ) = F mod (w) at (m a = 1.0, 1.3, 1.8 GeV (from bot- 
tom to top near the peaks) and running up to (a — 2.5 GeV 
with NNLL accuracy. The solid curves use Eq. Q240 with 
(ma = 1-0, 1.3, 1.8, 2.5 GeV and running up to (n = 2.5 GeV 
at NNLL. Note the stable tails of the solid curves. The dip 
at small ui is discussed in the next section. 



Our construction ensures that S(ui, (Mi) has the correct 
perturbative tail for large u>, specified by both the con- 
stant and logarithmic terms in Co(w,Mi)i while for small 
d it is controlled by the nonperturbative function F(k). 
Thus Eqs. and build in all the information that 
can be obtained from considering region 2) in Eq. ([3]) 
without having to carry out an expansion of the decay 
rate in this region in Aqcd/.Px- 

A common approach for modeling a parton distribu- 
tion function is to specify a model for S(u),(ma) at a fixed 
scale /iA and then run it up to a higher Mi- In this case, 
Ma must be treated as a model parameter, and changing 
it can cause significant changes in the model. Further- 
more, for lo 3> Aqcd the perturbative tail of S(u},(Mi) 
obtained in this approach will not be consistent with 
carrying out the OPE at Mi- The perturbative loga- 
rithms are reproduced, but the constant terms are not 
(and the latter are sizeable contributions to the OPE for 
the scales considered here). In Fig. [T]we compare this 
approach (dashed curves), with the superior approach of 
describing the shape function via Eq. (f2"4"]) (solid curves) 
at NNLL. In each case we use a model for F(k), F mod (k) 
given below in Eq. (|34p. whose first three moments cor- 
respond to m b = 4.7 GeV and A x = -0.31 GeV 2 . The 
thin black dashed curve shows S(u>,Mi) = F mod (oj). 
The other three dashed curves show the result of fix- 
ing S(uj,(m a ) = F mod (Lo) at (ma = 1-0, 1.3, 1.8 GeV (from 
bottom to top near the peaks) and running up to Mi — 
2.5 GeV using Eq. ([T]). The resulting tails at large lo are 
clearly inconsistent with each other. The solid curves 
show the result of our approach, using Eq. (|24|) to obtain 



S(oj,(ma) at (ma = 1.0) 1-3, 1.8, 2.5 GeV and running up 
to (Mi — 2.5 GeV. In our approach, S(ui,fj,i) is indepen- 
dent of the initial scale ma, up to subleading corrections 
in Q! s (ma)) and the tails at large u> are consistent with 
one another. The negative dip at small lo is an artifact 
of using the pole mass scheme, and will be removed by 
switching to short distance schemes in the next section. 
Another feature of the solid curves in Fig. Q] is that their 
tails become negative for lo > 2.5 GeV. It was noted in 
Ref. 14[ that most of this negative tail is canceled by the 
perturbative corrections from the jet function. We dis- 
cuss in Sec. IIIICI that this negative tail also disappears 
if ma is increased as lo increases. 

To obtain the correct perturbative tail, the procedure 
used by BLNP [28| for \V u b\ analyses is to take the pertur- 
bative computation of C${lo,(m) for lo > loq and a model 
for S(lo,(m) for lo < loq, and these two pieces are glued 
together, choosing lo$ so that the result is continuous. 
The advantage of our construction in Eq. (f2"Tj) is that the 
tail automatically turns on in a smooth manner when it 
dominates over the nonperturbative function F(k) and 
provides the proper (m dependence for S(lo,(m) at any lo. 

Imposing the moment constraints on F (k) in Eq. (j2"3")l 
provides a clean way to incorporate the information 
on the local OPE matrix elements, nib, Ai, etc., from 
B — > X c £p. It is possible to use a shape function scheme 
in which moments of S(lo,j£i with a cutoff define the non- 
perturbative parameters [14} . Our approach has the ad- 
vantage of allowing one to use any desired short distance 
scheme, as we discuss next. 



B. Short distance schemes 

The most precise information on the matrix elements 
in Eq. ([TB|) is provided by fitting OPE results to B — > 
X c lv decay distributions. This directly constrains F(k) 
through Eq. ([2"3")1 . Ideally, we would like to incorporate 
these constraints on F in a manner that is independent 
of the order in perturbation theory used to calculate P 
in Eq. ([7]). However, if we define the moment parameters 
6 and Ai from Eq. (|16[) in an infrared sensitive manner 
such as the pole mass scheme, then the dependence on 
the order in a s will not be small — infrared renormalon 
ambiguities in the perturbation series will cancel against 
ambiguities in the parameters S and Ai. In practice, this 
means that the values of 5 and Ai may change substan- 
tially when the fit in B — > X c iv is done at different orders 
in perturbation theory. In B physics, cancellations be- 
tween perturbative corrections are significant already at 
low orders in perturbation theory. Thus, it is preferable 
to define F(k) and Cq{lo,(m) so that they are individu- 
ally free of renormalon ambiguities, which should make 
the values of 8 and Ai more stable to the inclusion of 
perturbative corrections. 

Consider shifting to a new perturbative kernel Cq{lo, (m) 
and nonperturbative function F{k) that are free from 
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renormalons. To implement this we let 

CoH=C (w) + *C (w) ) 
F(k) = F(k) + 6F(k) , 

such that 



(26) 



S{u)= dkC (uJ -k)F(k) = / dk C (w - k) F(k) 



(27) 

These shifts move a series of perturbative corrections be- 
tween F(k) and Cq(w,/z). The shift 5F(k) will be chosen 
such that the moments of F{k) are given by renormalon- 
free parameters, and Eq. (|27|) then determines the corre- 
sponding shift SCo(lo). 

We switch from the pole mass nib and Ai to short dis- 
tance parameters, fhb and Ai, 



m b = fh b + Srrib , 



Ai = A, 



<5Ai 



(28) 



where Snib and SXi consist of series in a s (fi) with the 
same renormalon as nib and Ai, respectively. The free- 
dom to choose these series corresponds to the freedom to 
choose different schemes for fhb and Ai. To obtain F(k) 
with moments only depending on fhb and Ai, we pick 
SF(k) such that 

SF(k) = F(k) - F(k - 8m b ) - AF(k) , 
F(k) = F{k - Sm b ) + AF(k) . (29) 

Shifting the argument allows us to switch to fhb- To 
implement Ai, AF(k) has to satisfy 

JdkAF(k) = JdkkAF(k) = 0, 

Jdkk 2 AF{k) = ^i. (30) 



The most general solution to Eq. (f30|) is AF(k) = 
(<5Ai/6) F"(k), where F±(k) is an arbitrary function, nor- 
malized as J dkFi(k) = 1. 

Using Eqs. ^ and ((30J with Eq. 1(23]). one can eas- 
ily check that F(k) has renormalon-free moments, as de- 
sired, 



dkF(k) = 1, 
J dkkF(k) = S + 5m b = S, 



dkk 2 F(k) = -— + 5 2 . 

3 



(31) 



where i5 = nis — nib- Thus the experimental values of 
the b quark mass and Ai extracted in any short distance 
scheme can be used as inputs in our framework via the 
moment constraints in Eq. (|3ip . 

To determine the corresponding shift in Cq(ui), we note 
that Eq. (H7J) implies J dfc [C (cj - k) SF(k) + 5C (uj - 



Parameter 


Value 


a a (m z ) [29] 
a s (4.7GeV) 
m B 


0.1176 
0.2155 
5.279 GeV 


m\ s [12] 
Ai [1ZJ 


4.70 GeV 
— U.ol Lxev 


Ai 


-0.32 GeV 2 


mf n (l GeV) 
A^ in (lGeV) 


4.57 GeV 
-0.47 GeV 2 



TABLE I: Central values of the input parameters. The last 
three entries are computed from m\ s and Ai at order a 2 . 

k) F(k)] = 0. To solve for SCq(u>), we take the Fourier 
transform, SCo(y) = —Co(y) 8F(y)/F(y), and thus 



SC (y) 



6 y F(y) 



C (y). (32) 



Here, any 5nib 5Xi cross terms only have higher-order 
renormalon ambiguities and are dropped. Any choice of 
Fi(k) is equally good for incorporating the SXi shift. We 
adopt the simplest choice Fi(k) = F(k), which is unique 
in that it keeps Cq{u>) independent of the precise form of 
F(k). 1 In this case, in momentum space, we have 



C (u 



s~t t r \ ^Al d 2 . . 

= G {u) + drub) — ^2 



1 + Snib 



du 



gSnib) 2 



6X1 
~6~ 



d 2 i 

diJ 2 



(33) 



where Cq(oj) is determined by the perturbative calcula- 
tion of (b v \O (i^ + S, n)\b v ), and the ellipsis denotes terms 
that are either O(o.g) or beyond the order we are work- 
ing at for the moments of F(k). The same strategy to 
determine F(k) and derive the corresponding Cq(u>) can 
be applied to higher moments if in the future terms with 
n > 3 in Eq. (|23p are included in the analysis. 

In the remainder of this paper we use the convolu- 
tion formula for S(u>,fj,) in terms of Co(w,/u) and F(k) 
in Eq. (|2"T|) . We shall regard F(k) as the fundamental 
nonperturbative object to be extracted from data. In 
particular, in Sec. IIVI we will build a complete basis of 
functions for F(k). 



1 There are other possible choices. For example, taking -Fi(fc) oc 
k' 2 F(k) would ensure that the <5Ai shift does not change the 
small-fc behavior of F(k). However, if F\(k) ^ F(k), the shift 
&Cq{u)) depends on F(k), and must be recomputed each time 
F(k) changes. Hence other choices are more difficult to imple- 
ment when performing a fit to data to extract F(k). 
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FIG. 2: S(uj,n,i) obtained from Co(u;,/xa) with /xa = 1.3 GeV 
and run up to /x^ = 2.5 GeV at NLL (dashed) and NNLL 
(solid) order. Shown are results using the pole mass scheme 
and the 15* and kinetic short distance mass schemes. Switch- 
ing to the short distance schemes, the result becomes more 
stable going from NLL to NNLL, the negative dip at small uj 
in the pole scheme is removed, while the perturbative tail at 
large u> remains unchanged. 



C. Numerical results for S(lj, /x) 

To illustrate the effect of using our method to in- 
clude the perturbative corrections to the shape function 
through Co we choose a model for F(k), 



F mod (fc) 



(34) 



Here A = 0.8 GeV, f n (x) is given below in Eq. 
and the three parameters, cq, ci, and c 2 , are fixed 
to satisfy the constraints in Eq. (|31[) for the ap- 
propriate short distance scheme. For our numer- 
ical analysis we use the input values collected in 
Table HI With {ml s , Ai} we have {cq, ci, c 2 } = 
{0.949, -0.309, 0.064}, while for {m£ s , A^} we have 
{c , ci, c 2 } = {0.949, -0.307, 0.075}, and for {m kin , A x } 



we have {c , c u c 2 } = {0.988, -0.120, -0.095}. 

First, we switch from the pole mass to a short distance 
mass. We use the IS mass [3(| and the kinetic mass [3l| 
schemes, 



6 = 5 1S = m B 



,is 



6 = 5 kln = m B - m k b 



(35) 



to fix the first moment in Eq. (|3ip. and Ai to deter- 
mine the second moment. The choice of mass scheme en- 
ters Co (a;) through the <5m& and <5Ai in Eq. (f3"3")> . which 
must be expanded in a s to avoid the renormalons. De- 
tails of the implementation of short distance schemes and 
the expressions for 5m\ and 5m kln are discussed in Ap- 
pendix IA 31 In Fig. [2] we show the result for S(u, /x^) 
obtained from Cq(w,/xa) with /xa = 1.3 GeV, run up to 




0.2 0.4 0.6 0.8 1 

w [GeV] 



1.2 1.4 1.6 



FIG. 3: /xa dependence of S(ui, 2.5 GeV) in the peak region 
obtained from Co(w,/xa) at LL (dotted green), NLL (dashed 
blue), and NNLL (solid orange) order, using m\ s and Ai. The 
three curves in each case are for /xa = 1.0, 1.3, 1.8 GeV. The 
/xa dependence is significantly reduced at each higher order. 



Hi = 2.5 GeV at NLL order (dashed) and NNLL order 
(solid). The blue (dark), orange (medium), and green 
(light) curves show the results in the pole, 15, and kinetic 
mass schemes, respectively. In both short distance mass 
schemes the negative dip present in the pole scheme is 
removed, while the perturbative tail at large u> remains 
unchanged. The removal of the negative dip is similar 
to what was observed for the soft function for jets in 
Ref. [H- 

In Fig. [3] we illustrate the perturbative convergence and 
residual /xa scale dependence of the result of Eq. (|2~4")) or- 
der by order, using the IS mass scheme and Ai. We 
show S(u>,fAi = 2.5 GeV) run up from /xa at LL (dot- 
ted green), NLL (dashed blue) and NNLL (solid or- 
ange). For each order, the three curves correspond to 
Hi\ = 1.0, 1.3, 1.8 GeV. As expected, the /xa depen- 
dence is significantly reduced by going from LL to NLL 
to NNLL. For the lowest scale, /xa = 1-0 GeV, an os- 
cillation begins to build up at small lo, which is clear 
at NNLL where the curves for the two larger scales are 
quite stable. Although we continue to explore /xa as 
low as 1.0 GeV in this section, we take this as evidence 
that slightly larger values of /xa should be used to ensure 
a convergent expansion for Co- Therefore, we will use 
fj, A = 1.2, 1.5, 1.9 GeV in Sec. ILnDl for the decay rate. 

Next, we switch to short distance schemes for Ai. Note 
that the NLL results in Fig. [2] with Ai defined in dimen- 
sional regularization are already quite stable. Unlike for 
the pole mass, there is not much numerical evidence for 
the importance of switching to a short distance scheme 
for Ai, and adding a sizable <5Ai ~ 0{a s ) correction may 
oversubtract. The u = 1 renormalon in Ai is related to 
the large-order behavior of perturbation theory, and it is 
unclear how much numerical impact it has on the per- 
turbative coefficients computed at 0(a s ) and 0(a 2 s ). In 
App.[C]we show that schemes with 8\\ = 0{a s ) appear 
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0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 
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FIG. 4: The effect of using the invisible scheme for Ai on 
S(ui, 2.5 GeV) at NNLL order. The orange solid lines are the 
same as those in Fig. [3j using m\ s and Ai, while the light 
blue dashed lines use m\ s together with Ai . The three curves 
in each case are for the same values of /m as in Fig. [3] 

to oversubtract, causing large oscillations in the shape 
function at small lo. This is shown explicitly for the ki- 
netic scheme Ai in (= — fi%). Therefore, in App. [Cl we in- 
troduce a new short distance scheme with 8X\ = 0(a 2 s ), 
which we call the "invisible" scheme and denote by \\. 
The expression for 8\\ is given in Eq. (|A37[) . In this 
short distance scheme, the NLL results are unchanged. 
In Fig. 2] we compare results at NNLL for m\ and Ai 
versus using m\ s and X\. One sees that the invisible 
scheme has only a small effect on the NNLL shape func- 
tion, which is entirely at small lo. It damps the oscillation 
that occurs when fi A — 1.0 GeV and modifies the slope. 

In Fig. [5] we show the (i A scale dependence at NLL 
(dashed) and NNLL (solid) order in the tail region. The 
lower six curves use the same scale variation as in Fig. [3] 
Here, as one uses the SCET expansion, but enters the 
local OPE region, the scale dependence increases with u, 
and the tail becomes negative. This is due to increasing 
\q.{lo / fi A ) terms, for which the above choices of fi A are 
inappropriate. To avoid potentially large logarithms, we 
can increase fi\ as we increase to by taking, for example, 

/j.' a (lo) = a + b arctan(w — 2.5 GeV) . (36) 

To vary the scales, we take three functions of this form, 
with a and b chosen such that for lo = they give 
H' A = 1.0, 1.3, 1.8 GeV, and for u = 4.7GeV they give 
(j,' A = 2.35, 4.7, 9.4 GeV. In Fig. the six upper curves 
show S(u),fjLi) obtained with these Ma( w ) choices, tak- 
ing jii = n'i(to) = [h' a (lo) m^ 5 ] 1 / 2 with the central /i' a (lo). 
Comparing the upper and lower curves shows that in- 
creasing fj, A with lo significantly reduces the fi A scale de- 
pendence to a similar level as in the peak region shown in 
Fig. [3j (We have checked that using the running Ma( w ) 
does not have an effect on the size of the scale uncer- 
tainty in the peak region.) The upper curves in Fig. [5] 
also have a much less negative tail, which is caused pre- 




1.5 2 2.5 3 3.5 4 4.5 

us [GeV] 



FIG. 5: fiA dependence of 5(tJ,2.5GeV) in the tail region 
(lower six curves) and of S(u,fii) (upper six curves) at NLL 
(dashed) and NNLL (solid) order, using m\ s and Ai. For 
the lower six curves we vary (ia ~ 1-0, 1.3, 1.8 GeV, while for 
the upper ones we use the running scale parameters fj!\{ui) in 
Eq. (|36p and /4(<^) described in the text. 

dominantly by increasing fi A with lo, and only partially 
by increasing the reference scale fa. This means that the 
dominant part of the negative tail in the lower curves is 
caused by large logarithms of lo/ fa\. Hence, increasing 
fi A with lo will be important to obtain a positive rate in 
the tail region. The importance of increasing /i with a 
kinematic variable in the tail region was also pointed out 
for jet production in Ref . [32j , where it was required to 
avoid a negative tail for the cross section. 

In the decay rate the fa dependence of S(u>, fa) cancels 
against corrections from the jet function, so the interme- 
diate scale, fa, should also run with increasing lo. This 
is accomplished by taking fa = ^[(lo) = \p! A {u>) Hb} 1 ^ 2 for 
each of the three running /^(w)'s from Eq. (f3"6"f . This 
treatment of the tail has the advantage that we obtain 
Ma = Mi = Mfc f° r w ~ m b, an d by construction the stan- 
dard factor of two scale variation, m&/2 < /!/, < 2m&. 
Hence it is consistent with the local OPE treatment for 
region 3) in Eq. ([3]). The rate at which equality is ap- 
proached can be controlled by multiplying the argument 
of the arctan in Eq. (|3"6")l by a scaling factor, and the cen- 
ter of the transition region can be adjusted by modifying 
the 2.5 GeV value shown there. 



D. Perturbative results for the B — ► X 3 f spectrum 

In this section we explore the scale dependence of the 
perturbative corrections to the B — > X s j spectrum in 
the SCET regions 1) and 2) in Eq. ©. The rat e depends 
on three scale parameters ma ~ 1 GeV, m« ~ \A"b Aqcd, 
and fib ~ rrib- In a short distance scheme the decay rate 
in Eq. ([6]) becomes 

4% = r 0;s .ff s (Mt,M&) U H (m b ,Hb,Hi) 
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x JdkP{m b ,k,^)F{p+-k), (37) 



where H s and Uh , are given in Eqs. dXT|) and (|Xl4]) . and 
the notation P and F indicate that these are given in a 
short distance scheme. From the analysis in App. |Aj all 
perturbative corrections can be organized into a simple 
series of plus distributions £j(z), defined in Eqs. (|B9[) 
and (|B10jl . The integral of the perturbative function P 
with the F function is then 

JdkP(p-,k )f H)F(p+-k) (38) 

= V P J (p-,p+,^, f i A ) / dzC](z)F[p+(l-z)}, 

Jo 

where explicit results for the coefficients Pj are given in 
Eq. (|A25[) . The additional terms generated by transform- 
ing Eq. (f3"8")) into the integral over the short distance P 
and F are described in detail in App. IA 31 

In Figs. O and [5] we study the p\, Hi, and /Lit, de- 
pendencies of the p\ spectrum for B — > X s j, namely 
(dT s /dp+)/[T 0s |Cf ncl (0)| 2 ] . Recall that this has a sim- 
ple relation to the photon energy spectrum, p\ = tub — 
2E~ f . We show the p x spectrum to facilitate easier com- 
parison with the results for the shape function in the 
previous section. Since we are interested in studying the 
perturbative corrections, we keep F mod (k) fixed to be our 
default model in Eq. (JHU). In each of Figs. El and we 
show results at LL (dotted green curves), NLL (dashed 
blue curves), and NNLL (solid orange curves) order, for 
three different values of the scales, and using the IS mass 
and X\ scheme. The central values are fi\ — 1.5 GeV, 
fii = 2.5 GeV, and /x& = 4.7 GeV, two of which are 
held fixed in each plot. In Fig. [5] the three curves at 
each order show /xa = 1.2, 1.5, 1.9 GeV, in Fig. [7] they 
show pi = 2.0, 2.5, 3.0 GeV, and in Fig. |5] they show 
fib = 2.35, 4.7, 9.4 GeV. Since the scales should obey the 
hierarchy /ia < [Mi < //(,, it is not possible to vary each 
by a factor of two. For illustration, we do vary ^ by 
a factor of two, but keep fj,A and fii in ranges suitable 
to the physical shape function and jet function regions, 
respectively. The largest effect going from LL to NLL to 
NNLL is the change in the normalization. Although it is 
not captured by our range of scale variation, the normal- 
ization still exhibits reasonable convergence. Rescaled 
to a common normalization, the shape of the spectrum 
shows very nice convergence, and the range of scales used 
is clearly suitable here. We find a reduction in the scale 
dependence order by order for most values of p\ . 

In Fig. [5] we show three additional NNLL curves (long 
dashed light blue) that use m\ s and Ai instead of X\. 
Comparing these curves to the orange curves {m\ s and 
Ai ) , we see that the overall effect of using \\ instead of 
Ai is small. However, one can clearly observe that the 
invisible scheme improves the perturbative convergence 
of the spectrum at small p x . 
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FIG. 6: /ia dependence of the B — * X 3 "/ spectrum in terms 
of ml s and X\, using P mod (fc). Shown are the LL (dot- 
ted), NLL (dashed), and NNLL (solid) spectra for /ia = 
1.2, 1.5, 1.9 GeV. The long dashed curves are the NNLL spec- 
tra with m\ s and Ai, showing less convergence at small ui. 
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FIG. 7: fii dependence of the B — > X 3 ^f spectrum in terms of 
m} b s and Ai, using P mod (fc), at LL (dotted), NLL (dashed), 
and NNLL (solid) order for m = 2.0, 2.5, 3.0 GeV. 
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FIG. 8: fib dependence of the B — > X a ^y spectrum in terms of 
mf and Ai, using F mod (fc), at LL (dotted), NLL (dashed), 
and NNLL (solid) order for fj, b = 2.35, 4.7, 9.4 GeV. 
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Although we do not show results here for the spectrum 
in the local OPE region 3) in Eq. ([3]), we checked that 
the same effects as in the tail of S(oj, fa) in Fig. [5]appear 
in the tail of the spectrum. To avoid a large /^a depen- 
dence in the tail, the scales must be increased with u> as 
discussed below Eq. (I36|) . and this also helps to obtain a 
positive tail for the spectrum in region 3). 

It is important to emphasize that in the common ap- 
proach to model distribution functions corresponding to 
the dashed curves in Fig. [TJ where a fixed model for 
S(u,ha) is specified at the scale (ia and then run up to 
fa, the spectrum will have a large dependence on fi\, 
which must be considered a model parameter. Thus, 
there is no analog to the independence of the rate 
obtained in our construction, and illustrated in Fig. [6] 
Note that the dependence of the B —* X s j rate for 
Px — (Px)° ut on t ne soft and intermediate scales, fa\ 
and fa, has so far been studied only by performing an ex- 
pansion in A QCD /(p+) cut in region 2) of Eq. © [22, H3], 
as in Eqs. ([12")) and (TT4")) . In this case, one can combine 
the same perturbative ingredients as in our approach, de- 
pending on the three scales fi\, fa, and fib- As mentioned 
before, the advantage of our framework is that it does not 
rely on performing an expansion in region 2). 



IV. EXPANSION OF THE F FUNCTION 
A. Complete orthonormal basis 

So far in the literature the uncertainty related to the 
unknown functional form of the shape function has been 
either neglected or estimated by using a few model func- 
tions for S(ui, fi) , and varying their parameters or distort- 
ing them [H, [28|, HH, subject to constraints on their 
first few moments. To obtain a systematic estimate of 
the uncertainty related to the unknown functional form 
of the shape function, we construct a suitable set of com- 
plete orthonormal basis functions for the function F(k) 
defined by Eq. (|2"T|) . The uncertainty in the functional 
form is then determined by the uncertainty in the coef- 
ficients of this basis. This expansion will also be conve- 
nient to extract F(k) from experimental data, including 
experimental and theoretical uncertainties and correla- 
tions. 

Since F(k) has mass dimension — 1, it is convenient 
to introduce a dimension-one parameter, A, and use the 
dimensionless variable x = k/X. By power counting, 
A ~ Aqcd- We expect on physical grounds that F(k) 
is positive, so we can expand its square root, 



F(Xx) 



71=0 



(39) 



where f n (x) are a complete set of orthonormal functions, 
da; f m (x) f n (x) = S mn . (40) 



Since F{k) is normalized to unity, the coefficients c n sat- 
isfy 

1= fdkF(k)= fdxh^CnUx)] = ( 41 ) 



Although F(k) is independent of the choice of basis func- 
tions f n (x) when summing over all n, in practice only a 
finite number of terms can be kept. Therefore, we want 
to choose basis functions, f n (x), such that the first few 
terms in Eq. (|39[) provide a good approximation to F(k). 
Unfortunately, most of the well-known orthonormal func- 
tions on [0, oo) become broader with increasing n, and 
hence have moments whose values increase with n [35| . 
By dimensional analysis, the n-th moment of F(k) scales 
as Aq CD , and we would like the basis functions to satisfy 
this constraint, at least for the low-n moments. 

To construct a suitable orthonormal basis f n {%) on 
[0, oo), we consider orthonormal functions <fi(y) on [— 1, 1] 
and a variable transformation y(x), which maps x £ 
[0, oo) to y 6 [—1,1], We choose y(x) to be increasing, 
y'(x) > 0. Then, for any orthonormal basis 4> n (y) on 
[-1,1], 



fn(x) = y/y'(x) <j> n [y{x)] 



(42) 



provides an orthonormal basis on s 6 [0, oo). Choosing 
different </>„'s and different y(x)'s allows us to change 
the basis functions. It is natural to choose <fi n (y) to be 
polynomials of degree n, and we find it convenient to use 
the normalized Legendre polynomials 



/?T7 4- 1 1 H" 

My) - Pn(y) , P n (y) = ^^(v 2 - i) n 

(43) 

To determine y(x), note that for a positive definite 
function Y(x), such that J^°dxY(x) = 1, the function 



y(x) = -1 + 2 / dx'Y(x). 
Jo 



(44) 



satisfies y(Q) = —1 and y(oo) — +1. Since y'(x) = 2Y(x), 
the first basis function is simply 



[fo(x)r=y'(x)ti[y(x)]=Y(x) 



(45) 



To obtain a good approximation to F(k) with the first 
few terms in Eq. (|3"9")) . one should choose Y(x) to be "sim- 
ilar" to X F(Xx). This gives an intuition about suitable 
choices, and once Y(x) and A are fixed, the full basis is 
specified. A convenient choice for Y(x) is 



Y(x,p)= iP 1- 1 ^ 1 x*e-(P^ 
T(p + 1) 



(46) 



for any p > real parameter. Constructing a basis from 
Y(x,p) yields basis functions for which the first and sec- 
ond moments of fm(x)f n (x) are order one or smaller. 
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Therefore, the terms in the expansion in Eq. ([39)1 have n- 
th moments of order A" or smaller. Choosing the scaling 
parameter A ~ Aqcd, the first few terms in the expan- 
sion in the resulting basis gives a good approximation to 
the shape function. 

The function in Eq. (|46|) is similar to those used to 
model S(u>, /i) in the literature. In fact, by choosing Y{x) 
to be a specific model shape function, our construction 
allows one to expand about it, and systematically study 
corrections to an assumed functional form. We empha- 
size, however, that in our approach the only role of 4> n (y) 
and Y{x) is to specify the basis functions, /„(x). The 
functional form for Y{x) affects how quickly the expan- 
sion in Eq. (l39|) converges, but not the fact that it is 
a convergent expansion. The completeness of the basis 
4> n {y) on [—1, 1] implies that any square integrable func- 
tion on [0, oo ) can be expanded in terms of the bases 
fn(x) resulting from the above construction. 

Using Eq. (|46[) gives basis functions that behave as 
fn(x) ~ x p as x — > 0. Due to the short distance sub- 
tractions in Eq. (153"[) , to ensure that S(ui,fx) goes to zero 
at oj = 0, we need F(k) to go to zero at least as k 3 for 
k — » 0. Thus, we find it convenient to use Y(x, 3) as our 
default choice, with 



y(x,3) = 1-2 l + 4r 



32 



which gives the orthonormal functions 



fn(x) = 8 



2x 3 (2n+l) 



Pn[y{x,Z)] 



(47) 



(48) 



For numerical calculations we use A = 0.8 GeV as de- 
fault. These functions are also convenient because they 
allow analytic calculations of the decay spectra. The first 
five basis functions in Eq. (f48|) are shown in Fig. [9l In 
Fig. [TO] we use these to illustrate the uncertainty remain- 
ing in F(k) if its first three moments are fixed. We fix C3 
and C4 to 9 different combinations, and choose the first 
three coefficients co,i,2 to satisfy the moment constraints 
in Eq. dSU) - All the 9 functions shown have 0.92 < c < 
0.95, -0.35 < ci < -0.28, and 0.01 < c 2 < 0.14. Even 
this plot makes the uncertainties look smaller than they 
are, since at small k the k 3 behavior of these models ap- 
pears to imply a small uncertainty. Including the short 
distance subtractions from Eq. ([33]) . these models yield a 
significantly wider variation in S(u>,fi) for small to. Fig- 
ure[TOJshows that even with small errors of the B — » X c tu 
moments, more information on the shape function can be 
extracted from the B — > X s j or B — > X u lv data. 



B. Truncation uncertainties 

Equation ([3"9"|) provides a model independent descrip- 
tion of F(k) for any choice of the basis. Since the basis is 
complete for any value of A, we will regard A along with 
the function Y(x) as part of the convention that defines 




FIG. 9: The first 5 orthonormal basis functions in Eq. (J58 



c 3 = c 4 = 

c 3 = ±0.15, c 4 = 

c 3 = 0,c 4 = ±0.15 

s. c 3 = ±0.1,c 4 = ±0.r 




0.2 0.4 0.6 0.8 1 

k [GeV] 



1.2 1.4 1.6 



FIG. 10: Nine functions with identical first three moments. 
For each curve, we fix C3 and c 4 and then choose Co, ei, C2 
to satisfy the moment constraints in Eq. (|31[) with m\ s and 
Ai. The thick solid curve (03 = c 4 = 0) corresponds to the 
default model used in Sec. IIIII 



the basis. In practical applications one has to truncate 
the series in Eq. (|39[) after the first N + 1 terms, 



F< N <{k) 



N 



^ ' c n fr, 



N 



I V C C f (-) f (- 



(49) 



and use F^ N '(k) in the actual calculations. 

Figure [Tl] illustrates how the expansion converges. 
Both plots show a toy Gaussian model function (black 
solid curve) , which we expand in terms of the first n ba- 
sis functions in Eq. (|4"5)l for n < 4. In the left panel, we 
use the default value A = 0.8 GeV to define the basis. 
The truncated series in Eq. ([4"9"ll quickly approaches the 
model function, even for small values of N. In the right 
panel, we show the same expansion using A = 0.6 GeV 



12 




0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 

k [GeV] k [GeV] 



FIG. 11: Left: Expansion of a Gaussian model function F GauBB (k) = (2/a)(fc/a) 3 exp[-(fc/a) 2 ] with a = 0.639 GeV, corre- 
sponding to m,b — 4.7GeV, using the first 5 basis functions, fn(x), in Eq. (|48[1 . Right: Same as on the left, except that in the 
definition of the basis A is changed from 0.8 GeV to 0.6 GeV. 



to define the basis, which illustrates how the value of A 
affects the convergence of the expansion. 

To quantify the uncertainties, we would like to have a 
systematic way to estimate the error due to the neglected 
terms in the truncated sum in Eq. ([4^1) . The truncation 
error is affected by the choice of N, Y(x), and A, pro- 
viding several handles on this uncertainty. A virtue of 
adding one more term from our orthonormal basis, com- 
pared to adding one more parameter to a generic model, 
is that due to the orthogonality of the basis functions 
the additional parameter provides independent informa- 
tion and should avoid large parameter correlations. By 
varying N, one can change the number of coefficients, 
and study how sensitive the results are to the truncation. 
As a consistency check, one can use a different basis, or 
change Y{x) or A and check that the difference is within 
the previous truncation error estimate. 

A feature of our construction is that the truncation 



error can be estimated using "Y^ r 



1. We define 



y/\F(\x) = f(x) = fW(x) + c r f r (x) , (50) 



where 



N s 1/2 



n I ' 



(51) 



and the remainder function, f r (x), is orthogonal to 
/o,..., N (x), 



f r ( X ) = ^ r « fN+n(3 



(52) 



and normalized 



poo °° 

/ dx[/ r (x)] 2 = l, $>2=1. (53) 



In terms of f r {x), the truncation error from approximat- 
ing F(k) by F^ N \k) is 



F, 



(AO 



[f r ](k) = \F(k)-F^(k) 



(54) 



2 c r f r 



Although Eq. (f53|) implies |r„| < 1, it provides no 
bound on X)rJ r ™l- Therefore, one cannot derive a rigor- 
ous bound on |/ r (a0| horn Eq. (|52[) for all x > 0, and it is 
not possible to obtain a rigorous bound on the truncation 

r(N) 

trunc ' 



error, F}„,i„, either. However, for practical purposes, it 



is sufficient to estimate ftrunc- Its size is controlled by 
c r in Eq. (|5Tj) , which is determined by the first N + 1 
coefficients, and can be minimized by choosing a good 
basis. Thus, as long as enough coefficients are included 
in the basis so that the estimated truncation uncertainty 
is small compared to other uncertainties, any reasonable 
estimate of f r (x) provides a useful estimate of the trun- 
cation uncertainty. 

For suitable choices of the basis, we expect the series to 
converge fairly quickly, since the higher basis functions 
oscillate more and more rapidly, and we do not expect 
significant structure in the momentum distribution of the 
b quark in the B meson over momentum scales ^ Aqcd • 
Hence, one way to estimate f r {x) is to assume that one 
term in Eq. (|52p saturates the sum. It is natural to take 
that to be the first (possibly the second) term, which 
corresponds to 



f r {x) = ±f N+1 (x) 



(55) 



Another possibility is to assume that |/r(^)| ^ fo(x), and 
take 



f r (x) = ±/ (x) . 



(56) 



This is motivated by the fact that, except for small and 
large values of a;, fo(x) gives roughly an envelope for the 
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k [GeV] k [GeV] 

FIG. 12: Truncation error estimates. The solid black curves show F Gauss (fc) — F^ N \k) corresponding to the right plot in 
Fig. 1111 for N — 2 (left) and N = 4 (right). The dashed blue curves show ±max {F t ' r ^ c [±/jv+i]}, and the solid orange curves 
show ± max |i ;l t -. u ^ c [±/o]}, i.e., the error estimates using Eqs. (|55[) and (|56[) , respectively. 



higher basis functions, as can be seen from Fig. [9] In 
each case, the sign of f r {x) is undetermined. For small 
cy, the second term in Eq. (fB"4")) can be neglected and the 
sign of f r {x) is irrelevant. 

To illustrate these methods for estimating the trunca- 
tion error we consider again the toy model used in Fig. 111! 
with A = 0.6 in our basis as in the right panel. In Fig.[T2l 
the solid black curve shows F(k) - ^(ife) for N = 2 
(left panel) and N — 4 (right panel). The dashed blue 
curves show the estimate using Eq. (|55|) , taking the point- 
by-point maximum of F t y^ c [±/jv+i](&), while the solid 
orange curves use Eq. (|56p instead. As expected, the for- 
mer correlates more with the absolute value of the point- 
by-point deviations, while the latter roughly envelopes 
the difference. As can be seen, we obtain reasonable esti- 
mates of the deviation of F^ N ^(k) from F(k). The overall 
sizes of the truncation errors are well estimated in both 
plots, because they are proportional to c r . For the left 
plot c r = 0.088 and for the right plot c r = 0.025. If 
the expansion of F{k) converges reasonably fast, then 
f r (x) will be an oscillatory function, as shown in Fig. [T2"l 
Ultimately we are interested in how the shape function 
impacts the uncertainty in the extraction of | V u b | or that 
in determining the B — > X s j event fraction above a cer- 
tain photon energy cut. These depend on weighted inte- 
grals of F(k), so their uncertainties will be much smaller 
than the point-by-point errors in approximating F(x) by 
FW (x) shown in Fig. QJ] 



C. Subleading shape functions 

Any precision analysis of differential spectra in B — > 
X s ~f or B — > X u lv must incorporate power correc- 
tions that go beyond Eq. (g]). At order Aqcd/to&, 
six subleading shape functions enter the description of 
the most general inclusive spectra in B — * X s "f and 



B -> X u lv [H H3, IH HI]. We refer to these as the 
primary subleading shape functions. In addition there 
are terms in B — > X s j that enter from operators other 
than O7 and are related to the photon's hadronic struc- 
ture, whose contribution to the total rate is not calcula- 
ble in the OPE [5i, |4l| . Considering only the primary 
subleading shape functions and using suitably weighted 
integrals of the B — > X u lv differential rate together with 
B — > Xg'y, the (9(Aqcd/w&) shape functions can be can- 
celed in the determination of | V u b | |42j . However the same 
drawbacks apply to the weighting method at subleading 
order as those mentioned already in the introduction, so 
it is interesting to consider how our analysis can be ex- 
tended to include these subleading shape functions. 

The appropriate factorization theorem for the primary 
subleading shape functions, analogous to Eq. Q, is 
known from Ref. (39j . Thus, when one-loop partonic 
calculations of these functions and their corresponding 
jet functions are available, a construction analogous to 
Eq. (|21[) can be carried out to build in the proper large- 
u> tail, fJ-A dependence, etc. However, it should be cau- 
tioned that a large number of additional shape functions 
enter in the matching at 0(a s AQCD /mb) (Ha f43l |44|. so 
the utility of extending our full analysis to this level is 
unclear. 

Here, we simply discuss how a complete basis can be 
constructed for the primary subleading shape functions. 
We know less about these shape functions than about 
the leading order one. Their moments are still related 
to HQET matrix elements. In particular, the zeroth 
moment of the C(Aqcd /mb) shape functions vanishes, 
which means they must be negative for some values of 
k, and their first moments either vanish or are given by 
linear combinations of Ai and A2. The zeroth moments 
no longer vanish for the 0(AQ CD /ml) shape functions 
and beyond [13, |M S3 ■ 

For simplicity, we define the sign of each C(Aqcd /mb) 
shape function such that its first nonzero moment is pos- 
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itive. A convenient expansion for the functions with van- 
ishing first moment is given by 



(57) 



while the four-quark operator shape functions, whose ze- 
roth and first moments vanish, can be expanded as 



y^d n f n (x) 



(58) 



With this form, dfc (k) = / °° dfc ft if 4 « (fc) = 0. 
The first nonzero moments provide constraints on the 
coefficients, similar to Eq. (j4Tj) . for example 



(59) 



/ dfcfttf(fc) = A 2 Vd 2 , 
Jo 



which can be set to the appropriate linear combinations 
of Ai and A2. Note that a different set of coefficients, d n , 
occurs for each subleading shape function. 



V. EXTRACTING THE F FUNCTION AND 
PREDICTING DECAY RATES 

For a given basis for the leading and subleading shape 
functions we want to extract the basis coefficients, Cj and 
di, including their uncertainties and correlations. This 
extraction will use data on the B — > X s j and B — > X u tv 
spectra and data that determines moments of the shape 
functions from B — > X c £v. To simplify our discussion, 
we adopt a notation that is suitable for the coefficients 
{co, . . . , cat} of the function F(k) appearing in the lead- 
ing shape function. The generalization to incorporate 
subleading shape functions is straightforward. All results 
in this section are exact for N — > 00 and for finite N one 
has to take into account the truncation error discussed 
in Sec. HVBl 

Since the function F(k) enters the decay spectra lin- 
early, we can independently compute the contributions 
of the product of any two basis functions, f m {x)fn{x), 
in the expansion of F(k), which we denote by dr mn . 
The differential spectra dT s — dT s /dE 7 or dr u = 
dT u /dE^dp x dp x are then given by 



N 



E 



dr I( 



(60) 



where from combining Eq. © and (f3"9")l . the dT mn are 

dT mn = T H(p-)J* X dk P{P X ' k) 
' — k\ „ /pir — k^ 



For simplicity, we suppress the scale n in the arguments 
of H and P, and the u or s subscripts on H. The dT mn 
in Eq. (fBTj) act as a basis for the physically measurable 
distributions, and the result in Eq. I|60p is a quadratic 
polynomial in each of the fit parameters, Cj. 

In practice, experimental spectra are binned. Integrat- 
ing Eq. (|60p the rate in the i-th bin is 



N 

E < 

m,n— 



r' 



(62) 



where T l mn is the integral of dr m „ in Eq. ([6"T|) over the 
phase space region of the i-th bin. In addition, we can 
impose constraints on the moments of F(k), 



M j = / dfc k j F(k) = V 
Jo „ 



™n Ml- 



(63) 



Here, M° are given in terms of experimental measure- 
ments of 8 and Ai from B — ► X C £D, while M^ n is 



M 3 = 

mn 



(64) 



One can determine the coefficients {co, . . . , cat} by fit- 
ting Eqs. (f6"2"]l and (|r?5)) to the experimental data. It is 
straightforward to combine several different spectra and 
measurements by different experiments. Doing a simulta- 
neous fit to Eqs. (f6"2"| and (|6"3l a llows one to combine the 
B — > A s 7 shape information [y, Mj8| and information on 
the mx spectrum in B — > X u lv [9j with the information 
on rrib and matrix elements of local operators known from 
B -> X c lv [13, Ellil- Given the theoretical input, T l mn 
and M^ n , computed in this paper, one needs to simply 
fit quadratic polynomials in the fit parameters, c„, to the 
data. The experimental uncertainties and correlations in 
F l and M 3 and the theoretical uncertainties and correla- 
tions in T^yj and M^ ra will translate to uncertainties of 
the coefficients c n . In this approach, the uncertainty in 
the functional form of the F function is automatically and 
straightforwardly determined by the uncertainties of the 
coefficients {co, . . .,Cjv}j and the truncation error from 
approximating F(k) by F^ N \k) discussed in Sec. IIV 51 

Finally, in Fig.[T3]we combine our results to show the 
B -> X sl spectrum, (dr s /dS 7 )/ [r 0s |C^ ncl (0)| 2 ] , in the 
B meson restframe. The results are shown using the 
short distance parameters m\ and \\, and the nine 
shape function models plotted in Fig. [TU1 which have 
fixed zeroth, first, and second moments. The solid black 
curve is our default model, F mod (fc) in Eq. (31]). In the 
current and future experimental analyses ms — e 2.E 1 < 
2 GeV. Our formalism has the virtue that we do not 
need to distinguish regions 1) and 2) in Eq. (J3J, i.e., it 
is simultaneously valid for both tub — 2E^ ut ~ Aqcd 
and ^> Aqcd- The variation of the curves near maxi- 
mal E 7 indicates that in the peak region the first few 
moments of the shape function are not sufficient to pre- 
dict the spectrum. However, for E 1 < 2.1 GeV, the un- 
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E 1 [GeV] 

FIG. 13: The B — > X s ^f photon spectrum in the B rest frame 
using the nine shape function models shown in Fig. 1101 

certainty in the prediction becomes significantly smaller, 
and the leading order shape function model uncertainty 
diminishes. The spectrum in this region is also affected 
by subleading shape functions and additional perturba- 
tive corrections not studied here. (Note that the mea- 
sured spectrum is somewhat broadened by experimental 
effects.) We found the same conclusion using many other 
shape function models, constructed from different bases, 
and expect that this will also be substantiated by an ac- 
tual experimental analysis in which the space of shape 
functions is explored by allowing for more components in 
our orthonormal basis and a simultaneous fit to all rele- 
vant data. This will have important implications for the 
B — > X s j rate in the presence of an experimental cut, 
the extraction of \V u b\ from inclusive semileptonic B de- 
cays, and the analysis of the inclusive B —* X s l + £~ rate 
in the small q 2 region. 

VI. CONCLUSIONS 

In this paper, we introduced new methods for calcu- 
lations relevant for inclusive B decays which involve the 
parton distribution function of the b quark in a B me- 
son, called the shape function. On the theoretical side, 
our results allow for an improved description of the decay 
rates and a more reliable assessment of the uncertainties 
than earlier studies. On the experimental side, they are 
straightforward to implement and provide a transparent 
way to combine constraints on the shape and moments 
of the shape function, with controllable uncertainties. 

The shape function is constrained by the measure- 
ments of the shape of the B — > X s ^f photon energy spec- 
trum @, 0, Q and the mx spectrum in B — > XJLv @ , 
and its moments are related to the b quark mass, mj,, 
and nonpcrturbative matrix elements of local operators 
in the OPE, which are constrained by fits to B — > X c lv 
decay distributions [l(| EL E3] ■ 

The first key ingredient in our analysis is the new de- 



scription of the shape function, given in Eq. |J5J| and dis- 
cussed in detail in Sec. IIIH which is by construction con- 
sistent with the renormalization group evolution and the 
perturbative result for the tail of the shape function. It 
allows combining all experimental information from the 
shape of B — > X s ~/ and B — > X u lv spectra, and njj, and 
Ai constrained by B — > X c iv distributions. Any short 
distance scheme for the b quark mass and the kinetic en- 
ergy matrix element can be implemented. We presented 
a simple formula for the differential rates which incorpo- 
rates resummed perturbative corrections, with details of 
the derivation given in Apps. lAlandlBl 

The second key ingredient in our analysis is the expan- 
sion of the F function, describing the nonperturbative 
part of the shape function, in a complete set of orthonor- 
mal functions in Sec. IIVI This gives a new way to quan- 
tify uncertainties in the functional form of the shape func- 
tion, which was previously not explored fully systemati- 
cally. Choosing Y(x) in Eq. (|4"4")l to coincide with any of 
the models used in the literature [H, H [M |H, Ell gives 
an orthonormal basis for the shape function in which the 
first function is the model and corrections can be studied. 

One should use renormalon-free short distance defini- 
tions for input parameters, such as the b quark mass, mj, 
or the kinetic energy matrix element, Ai. We found that 
in our framework the kinetic scheme definition of A^ m 
seems to oversubtract from the HQET definition of Ai in 
dimensional regularization, numerically similar to using 
the MS mass for inclusive spectra. To solve this problem, 
in App. [C] we introduced a new short distance "invisi- 
ble" scheme, A^, which is renormalon-free and only dif- 
fers from the usual definition starting at order a 2 . While 
it does not improve the behavior of the perturbation se- 
ries decisively, being almost invisible, at least it does not 
make it worse. 

It should be emphasized that all developments pre- 
sented in this paper are consistent with and incorporate 
all predictions that follow from QCD, without recourse to 
models or relying on any ad hoc assumptions. In fact, any 
method consistent with the factorization theorem and the 
OPE discussed in Sec. Hi] can be cast in our framework. 

Our results lead to the following strategy to determine 
\V u b\ with optimal and reliable uncertainties: 

1. Fix a basis for the expansion of the F function by 
choosing a suitable Y{x) and a value for A. 

2. Do a combined fit to the binned B — > X s -f and/or 
B — > X u lv spectra, and to the information on rrib 
and Ai from B — > X c £u (and possibly higher mo- 
ments) to extract the basis coefficients {co, . . . , cjv}- 

3. Verify that the truncation error is small compared 
to other uncertainties. 

4. Use the values and covariance matrix of the ex- 
tracted basis coefficients to make predictions with 
reliable uncertainties. 

More details on the shape function fitting procedure, 
on extracting |Ki&| from various B — > X u iv differential 
decay distributions, and predictions for the B — > X s j 
rate will be presented elsewhere. 
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APPENDIX A: PERTURB ATIVE RESULTS 



In this appendix we collect the known results for the 
hard functions H UtS (p^, fii) in Eq. (j4|), and derive an an- 
alytic result for the function P{p~ ,p~x, Hi) in Eq. 0, 
which includes perturbative corrections from the jet and 
soft functions as well as the shape function RGE. In 
Ref. [32j it was shown in the context of dijet production 
that the convolution form for the soft function analogous 
to Eq. (f2"Tj) allows all factors associated with perturba- 
tive corrections and RGE to be evaluated analytically. 
Here we show that this is also true for inclusive B de- 
cays. Our calculation differs from Ref. [HJ in that we 
develop a method that avoids using an imaginary part 
at intermediate steps. (An alternative analytic method 
that avoids the plus distributions by using moments was 
developed in Ref. [471].) Furthermore, with our basis for 
F(k), we show that the integrals over the basis function 
can be written in terms of hypergeometric functions. 

We discuss fixed order results in Sec. IA 11 the RGE 
in Sec. IA 2\ and the rate in short distance schemes in 
Sec. IA3I The definitions of the required plus distribu- 
tions and many useful relations are collected in App. [B] 



depend on fj,b and is defined by the split matching proce- 
dure in Ref. [4|, which separates the perturbation scries 
above and below the scale mb. It can be written as L48| 



Cr'(O) = C 7 + F 7 (0) + G 7 (0) , 



(A2) 



with the central values C 7 ncl (0) = -0.341 - 0.015i and 
C7 = —0.261. The Wilson coefficient C7 and the func- 
tions F 7 (q 2 ) and G 7 (q 2 ) are defined to be separately /j, 
independent. Here, C7 contains the dependence on the 
MS b quark mass and the Wilson coefficient CV(/z) from 
the operator O7 in the weak Hamiltonian, 



C 7 = C 7 ^) 1 ^-+C i (n)K i7 



mb 



4rr 



3 VTLb 



32 
~9~ 



c 8 (h) 



+ W(7SV + 7lr 0) ) 



+ 0(a 2 s ). 



(A3) 



The Wilson coefficients Ci(/x), 



and the anomalous 



dimensions 7>° are with respect to the operator basis 
of Ref. and are given explicitly in Ref. The 
function Fj(q 2 ) contains perturbative corrections from 
other operators in the weak Hamiltonian, while Gj(q 2 ) 
contains nonperturbative corrections from intermediate 
cc states. They are given in Eqs. (A5) and (A12) in 
Ref. 

The coefficient h s (mb,Hb) in Eq. (|Aip corresponds to 
the matching coefficient for the tensor current in SCET. 
At 0(a s ) it was computed in Ref. [5(J. To determine the 
hard matching coefficient at 0(a 2 ) we take the two-loop 
computation of the |CV| 2 terms in the b — ► S7 rate and 
spectrum from Refs. |5l| and subtract the terms in the 
partonic two-loop spectrum in SCET coming from the jet 
and shape functions (given by P(m,b,k, fib) in Eq. (|A30[) 
below). Due to the split matching, the /ib dependence 
in h a (rrib, Hb) cancels entirely against the Hb dependence 
in Uff(Tnb, fib, Hi) m Eq. (|AL|) . and does not depend on 
the fx dependence of the coefficients in the electroweak 
Hamiltonian. We obtain 



1. Fixed order results 

In Eqs. (HI) and © the leading order hard function 
H 3 (pj c ,Hi) for B —> X s j is 

H s (p+,tH) = {mB ~t x)3 icno)i a 

b 

xh s (m b ,iJ,b)U H {mb,Hb,Hi), (Al) 

where the evolution factor Unimb, Atf>, Hi) is given be- 
low in Eq. (|A14j) . and has the boundary condition 
Unijnb, Hb, Hb) = L The Wilson coefficient C 7 ncl in H s 
contains the weak scale matching of the full theory onto 
the effective Hamiltonian from which the W and t are 
integrated out, and resums perturbative corrections be- 
tween the weak scale and the scale \i ~ mt>. It does not 



h s (m b ,Hb) 

7T V nib 2 nib 



IT 

3+ 24 



ir z 12 nib 



5 r 
2 Cf 



6 / nib 



+ 



49 tH\ /tt 2 1\ 25 

T + 2l)^ + (l2-3)^-24^ 

/117 177T 2 \ /rr 2 29 

lr6" + ^" 3C3 J Cf+ lT2"72 



ln 2 ^L 



29 , llCa 



C A 



0o 



/341 TT 2 
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48 
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3563 29tt 2 



A- 
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(A4) 
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Our expressions here agree with the corresponding nu- 
merical results in Eq. (99) of Ref. (52j. Note that the 
inverse powers of mj in Eqs. (|Al|) . (|A3j) . and (|A5|) below 



cancel the m\ factors in Tq s .u in Eq. ([5]), and any mass 
scheme can be used. The mj appearing in h s (mb, fib) and 
Unimb, fib, Hi) is defined in the pole scheme. Here, the 
pole mass renormalon ambiguity is less relevant, because 
mb only appears as a reference scale for fib, so the effect 
of changing nib is included in the variation of /if,. For 
numerical calculations, we use mj, = 4.7 GeV. 

The hard function H u (E£,p x ,p x > fii) for B — > X U £D 
in Eqs. (fj]) and ^ is given by 



Hu{Ee,p x ,Px,fJ-z) 



24 



(m B - Px) {2E e + p x - m B ) 



To all orders in perturbation theory the jet function 
and shape function kernel can be written as 

J(p 2 ,p l ) = — ^ Jn[a s (Hi)} £n(p 2 /rf) , 
n=-l 



1 °° 

Co(^>Ma) = — S n [a s (fiA)} C n (ui/nA) , 

MA 

n— — 1 
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where the C n (x) are plus distributions defined in Eq. (jB6|) 
for n > and in Eq. (|B10|) for ra = —1, and the coeffi- 
cients J n (a s ) and S n (a s ) have expansions in a s . The 
jet function coefficients are known at one [2l[ and two 
loops [53|], and are given by 
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where the h U i(nib,p , Hb) correspond to the matching 
coefficients of the V — A current in SCET. To order 
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m b — p \mb-p 



The corresponding two-loop results for H u will be easy 
to implement once they become available. Finally, the 
evolution factor Uh(p" ,Hb, Hi) m Eq. (|A1|1 is identical 
for H s and H u , and is given below in Eq. (j A14|) . 

We now turn to the function P{p~~ ,k, Hi) in Eq. |(7J). 
Changing variables w — * a/ + fc — w, we have 



P(p ,k,fM) = / duU s (k -u,Hi,Hh) (A7) 
x / du/p"J[p"(w - a/), Mi] C q (u',ha) . 



To incorporate the fixed order a s corrections to the 
jet function, J(p 2 ,Hi), and the shape function kernel, 
Cq(uj, fi\), we will first carry out the u' convolution in- 
tegral. The evolution factor Us(px' Mi) Ma); which sums 
logarithms between the scales Hi and ha, and the integral 
over w are discussed later. 
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The shape function coefficients to one [2l| and two 
loops [54j are 
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Eq. (|A7|) . The factor Uh(p~ , Hb, Hi) which describes the 
evolution of the hard functions in Eqs. (|Aip and (IA5|) 
between the hard scale, Mb) and the jet scale, /i,, is [50| 



All other coefficients in J and Cq start at higher orders 
in a s . 

To convolute £„[p~(w — oj')/h 2 ] with C m (u)/fj,\) we 
first rescale them to have the same dimensionless argu- 
ments. Using Eq. (|B13[) . J and Co satisfy the rescaling 
identities 



(A14) 



J(jp w,m) = — 17 E Jn a s (/i),^ 



n=-l 

OO 



where definitions of 77(^4, Mb) and Kui^i, Mb) are given in 
Eq. (|A17j) below. The soft evolution factor Us(u), Hi, //a) 
in Eq. (|A7[) sums logarithms between the jet scale, Mi> 
and soft scale, /ia- To all orders in perturbation theory 
it can be written as 
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C Q (w^) = - E S« ^]A»(|). (All) L/ s ( w , M . i , MA ) = e Ks ^^) 



where £ is an arbitrary dimension-one parameter that we 
will choose at our convenience later on, and the rescaled 
coefficients are 
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Using Eq. (jATTjl . the convolution of J and C in Eq. ((AT)) 

becomes 

(J <8> C Q ){u),HuHh) 

da/p~J[p _ (tj - a/), Mi] Co(w',(j,a) 
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In the last step we used Eq. (|B16[) to perform the x in- 
tegral, yielding a sum of plus distributions, whose coeffi- 
cients V e mn are given in Eq. (|B17[) . 

2. Renormalization group evolution 

Next, we summarize results for the renormalization 
group evolution, and then carry out the u> integral in 
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where r\ = ^(Mi,Ma)) an d we defined 



W r(i + r,) 



(Ai6) 



To go from the first to the second line in Eq. (|A15[) 
we used Eqs. (|B8|) and (|B12[) . An expression for 
Us{uJ, Hi, Ma) was first found in Ref. [HI and extended 
to all orders in Ref. [13]. The form with the plus dis- 
tribution derived in Ref. [32| makes the formula valid 
without having to require jii > MA- 

In Eqs. (|A14p to (|A16[) . r)(p, Mo) and K x (fj,, Mo) for x — 
H or S are given to all orders by 
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Here, r cusp is the universal cusp anomalous dimension. 
The only difference between Kh and K$ are the hard and 
soft anomalous dimensions, "f x = 75 or jjj- Expanding 
the /3 function and anomalous dimensions as usual, 
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with r = a s (fj,)/a s (no). When using ti{h, Ho) and K x (h> Ho) at LL, NLL, and NNLL according to our conventions in 
Eq. (f2"5|) , we do not re-expand the results in Eqs. (|A14[) and (|A16|) . but keep their full expressions everywhere. For 
the running coupling we always use the three-loop expression 
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where X = 1 + a s {no)Po l n (/VMo)/(27r), and we evolve to 
lower scales using the reference value a s (Mo = 4.7 GeV) = 
0.2155 as in Table|T]with n/ = 4. Up to three-loop order, 
the coefficients of the P function in the MS scheme are 

Po = Y Ca ~ s n f > 
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The cusp [55j, [56J , soft [57J, 158J , and hard anomalous di- 
mension coefficients are 
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To determine 7^ we used 7^ = —7/ — jf , which follows 
from the m independence of dr s /di? 7 . We use the two- 
loop computation of 7/ in Ref . [531 ] . 

The evolution of the shape function kernel Co(w,/x) 
from ha to /ij can be written as the sum of a finite number 
of terms, 

C (u), m) = / dui' U s (lu - u', Ha) C (w', ma) 
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In the last step, we used Eq. (|B16[) to perform the x inte- 
gral. The coefficients of the resulting plus distributions, 
Vp{rj), are defined in Eq. (|B17[) . If Cq(uj,h) is known to 
0(a k s ) accuracy, then the terms contributing in Eq. (|A23[> 
are bounded by n < 2k — 1. To obtain the shape func- 
tion evolved up to Hii S(uj, Hi)i we combine Eq. (|2ip with 
(TA23| taking for convenience £ = u> and changing the in- 
tegration variable to z via k = u>(l — z). This gives the 
result for S(w,Hi) that is quoted in Eq. (124(1 in the text. 

Returning to P(p~ ,k, Hi) m Eq. (|A7|) , to perform the u> 
integral we can now apply the same steps as in Eq. (|A23[) 
to compute the convolution of t/s(w, Hi? Ha) with the re- 
sult of Eq. (fAT3|) . This yields 
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where r\ = r)(jj,i, fi&), and the coefficients are 
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with y/(?7) and V e mn given in Eqs. (lB17)l and (fBlS)) . The 
jet and shape function coefficients, J m and S n , have per- 
turbative expansions in a s (fii) and ^(/ja), respectively, 
given by Eqs. (|A9j) and (|A10|) together with Eq. (jAffjl . 
To 0(a s ) we have m,n < 1, so dropping cross terms of 
0( a s) and higher, we need — f < I < 1 and — 1 < j < 2. 
To 0(a 2 s ), we have m,n < 3, and we need cross terms 
between J m and S n up to — 1 < ^ < 3 and — 1 < j < 4. 

The final step now is to compute the convolution of 
P(j>~ , k, with some function F(k), 
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(A26) 
-z)]. 



In both Eqs. (|24[) and (|A26|) the remaining integral that 
needs to be performed is C^(z) with F[ui(l — z)]. For 
j > it is convenient to simplify this integral to 



dz C](z) F[u(l - z)\ 



= / dz 



— {F[c(l-z)]-FH} 
dt ln J ' t {F[w(l - i 1/r ')] - F(w)} 



7? 

where the second line is valid for ?; > —1, but the last 
line only for r\ > 0. For j = — 1 the integral is trivial 
since C r !_ 1 (z) = S(z). Our basis for F(k) involves a series 
of terms of the form 



f(fc) = ^a,Q) r exp(- s |), 



(A28) 



where r and s are integers. For each term in the series 
the integral in Eq. (|A27|) can be performed analytically 
using 
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By taking derivatives of this result with respect to a 
one can obtain the integral with (In z) J on the left-hand 
side. In practice, we find that the numerical integration 



of Eq. (|A27P is sufficiently fast that using Eq. (|A29[) is 
unnecessary. 

Finally, we comment on a special case of the above 
results that was used to carry out the B — > X s ~/ matching 
computation to derive Eq. (|A4[) . Here we used the SCET 
computation of the decay rate with fj, b — /ii — fi\ = 
m b and rj — 0, which depends on P(mb,p^-, nib)- For 
this special case, the choice £ = m b is convenient and 
Eqs. (|A24p and (|A25[) then reduce to 

OO _|_ 
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Thus Pj (m b , m b , m b , m b ) is the coefficient of the plus dis- 
tribution or S distribution Cj in the fixed-order compu- 
tation evaluated at ji = m b . 



3. Changing to short distance schemes 

To change the scheme from the pole scheme to a short 
distance scheme, we define a perturbative function P as 
in Eq. ((7]), but with Co replaced by Co, and in the dif- 
ferential spectrum P and F are replaced by P and F. 
Displaying only the integration variables, 
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with 



P(k) = JdujUs(k-Lu) Jduj' J(w-o/)C (a/) (A32) 
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and <5mb and <5Ai defined in Eq. (|28|) . This result has 
exactly the same form as Eq. (p3|) . which relates Co(w) 
and Co(w), so the analysis we carry out below also holds 
for determining P(k) and Cq(u>). To be definite, we use 
P(k) below. The results for Co(w) are obtained by sub- 
stituting P(k) — > Co(w). 

To ensure the proper cancellation of renormalon am- 
biguities, the perturbative series in 5m b , 6\±, and P{k) 
have to be re-expanded to the desired order. Denoting 
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where the dummy variable e = 1 counts the order in 
the perturbative expansion. As indicated, the scale for 
a s in 5mb and 5\± must be set to fiA to ensure that 
renormalons cancel. To illustrate this, the zxa varia- 
tion in Fig. [3] yields the numbers 0.74, 0.76, 0.72 GeV" 1 
for S(u — 0.5 GeV,//) at NNLL. However, holding /xa 
in Srrib fixed at the central value the result becomes 
0.81, 0.76, 0.68 GeV^ 1 , with a much larger scale depen- 
dence. 

For P(k), we count both a s (fj,i) ~ e and a s (fjt\) ~ e in 
5„ and J m , i.e., we expand the cross terms between J m 
and S n . Note that we do not expand the cross terms in 
the product of H q and P in Eq. (jA3Tj) . Then, to C(e 2 ) 

P(fc) = e° p(°> (fc) + e 1 P« (fc) + e 2 P< 2 > (k) (A34) 
d 5Af 5 d 2 " 



X (!) " 
^ dk 



X ( 2 ) d 
^ dfc 



6 dfc 2 



r ^/'J/'" — — — eP (0) (fc) + e 2 pW(fc) 

and integrating by parts we can move the derivatives to 
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For nib, we mostly use the IS scheme [30], in which 

(A36) 
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By default we take P = 1 GeV. 

We also list the corresponding expressions in the ki- 
netic scheme [3l|, defining A^ ln = — fi 2 and using R for 



the momentum cutoff, 
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By default in the kinetic scheme P = 1 GeV. 



APPENDIX B: PLUS DISTRIBUTIONS AND 
CONVOLUTIONS 

We define a general plus distribution for some function 
q(x), which is less singular than 1/x 2 as x — > 0, as 



[q(x)] [ ? ] =[6(x)q(x)][ 



(Bl) 
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Since Q(xq,xq) — 0, the point xq should be thought of 
as a boundary condition for the plus distribution. Inte- 
grating against a test function fix), we have 



dx[6(x)q(x)]X o] f(x) (B3) 
""da: q(x) [f(x) - f(0)} + f(0) g(x max , xo) . 



Taking f(x) = 1 in Eq. (|B3p one sees that the integral of 
the plus distribution vanishes only when integrated over 
a range with x max = xq. Plus distributions with different 
boundary conditions are related to each other by 

[8(x) q(x)] J o] = [6(x) q(x)] J l] + S(x) Q(x 1 , x ) . (B4) 

We will almost exclusively use the boundary condition 
xq = 1, and will drop the superscript [xq] on the plus 
distributions when this default choice is used. 

Taking the special case q(x) — l/x 1 ~ a with a > — 1, 
we define 
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With our boundary condition, C a (x) for a = reduces to 
the standard definition of [9(x)/x} + . For q(x) = ln n x/x 
with integer n > we define 



Finally, we define the "mixed" distribution 
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Since both £ a (x) and C n {x) are defined with the same which satisfies C^{x) = C n (x) and Cq(x) = C a {x). For 
boundary condition xq = 1, they are related by convenience we also define 

d" 
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C-i(x) = C a _ x {x) = 8{x) 



(BIO) 



This makes it easy to derive identities involving C n {x) 

from identities involving L a {x) by taking derivatives with The following identities are useful 
respect to a. 

The definitions in Eqs. (jB5| and (|B6|) can be contrasted 
with those in Ref . [32j , where the same boundary condi- 
tion xq = 1 is used for [9(x) In" x/x] + = C n {x), but 
xq = oo and xq = are used for [0(x)/x 1 ^ a ]+ with a < 
and a > 0, respectively. This is the form appearing in 
the soft evolution factor, Eq. (|A16|) . where it comes mul- 
tiplied by a factor of a, so the limit for a — > exists. 
Using Eq. (|B4|1 to convert to our definitions, we have 
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The satisfies the rescaling identity (for A > 0) 
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from which we can obtain the rescaling identity for C n {x) 1 
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This agrees with the result in Eq. (C3) of Ref. [32l| . We will also need convolutions of two plus distributions, 

V(a,b) + 
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In the second step we used the definition in Eq. (|F35[) . Here V(a, b) is defined by 

r(a)r(6) 1 1 



V(a,b) 



F(a + b) 



(B14) 



(B15) 



which satisfies V(0, 0) = 0. Taking derivatives with respect to a and b we can get the corresponding formulas for 
convolutions involving C n , 



dyC a {x-y) C n {y) 



d™ 
do™ 



C a+b (x) + 



5(x) 



V(a,b) 



C a n+1 (x) C-(x)-C n (x) 



fc=0 



71+1 



n+1 



1 1 



k=-l 
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/ 



dy C m {x - y) C n (y) 



d rn d r, 



5(x) 



da m db n 

m+n+1 



V(a,b) 



=6=0 



77i + l n + 1 



[x) 



(B16) 



fe=-i 



The result in the second line of Eq. (|B16[) reproduces a result given in Eq. (B6) of Ref. [59(. The coefficients V£(a) 
and V k mn are related to the Taylor series expansion of V(a,b) around a = and a — b — 0. The nonzero terms for 
n > are 



d™ V(a,b) 



db n 



b=0 



mo) = 



\ d n ~ k 



+ <>fcn 



b=0 



n+ 1 



fc = -1, 

< fc < n, 
k = n + 1 . 



(B17) 



The term 6k n in V fc ™(a) and the last coefficient V^ +1 (a) arise from the boundary terms in the convolution integral. 
The V fc mrl are symmetric in m and n, and the nonzero terms for to, n > are 



d™ 7(a,6) 



da m db n a + b 



=0 q=0 

1 1 



a=6=0 



jm-p r|n-g 



p J \qj da m 'P db n ~<i 



a=b=0 



, TO +1 71 + 1 



< k < to + n . 



k = m + n + 1 



(B18) 



The last coefficient T^ I + l „ +1 again contains the boundary term. Using Eq. (|B10|) we can extend the results in Eq. (|B16j) 
to include the cases n = — 1 or m = - 1. The relevant coefficients are 



V-l («) = 1 , V -\a) = a , 7^ (a) = , 



T r— l,n irll.- 1 r 

Vu = Vu = 6, 



k ■ 



(B19) 



APPENDIX C: THE INVISIBLE SCHEME FOR Ai 

In this appendix we define a new scheme for Ai, which 
we call the "invisible scheme". It is a short distance 
scheme, free of the u = 1 renormalon ambiguity, and 
it only deviates from the standard HQET definition of 
Ai at 0(a 2 s ). Since the renormalon in Ai depends on 
the regularization scheme and in particular its symme- 
tries [13, [6lj] 5 it is desirable to define Ai using a scheme 
which has the same symmetries as the multiloop dimen- 
sional regularization calculations of its coefficient func- 
tion. 

For a general ultraviolet (UV) regulator Auv, the bare 
kinetic energy operator is 

\b v (iD±) 2 b v ] baro = Z ± b v b v + Z kin b^iD^) 2 b v + . . . , 

(CI) 

where Z\ oc A^ v and the ellipses denote higher dimen- 
sion operators whose coefficients vanish as Auv ~ * 00 • 
Usually the kinetic energy matrix element in HQET is 
defined by 

Ai = {B\b v {iD ±y b v \B) , (C2) 



where UV divergences are regulated in dimensional reg- 
ularization (we follow our convention of using full B 
states even if this is not always the practice in HQET). 
In that case, power divergences do not appear, so in 
Eq. (|Cip Z\ = 0. Furthermore, since this scheme respects 
reparametrization invariance [62j, Z^ n — 1 and Ai is fi 
independent. Nonpcrturbatively Ai can still be sensitive 
to the quadratic UV divergence of Z\ through a« = l 
renormalon. The presence of this renormalon implies 
that there is an C(Aq CD ) ambiguity in the definition in 
Eq. (|C2p . In observables like a decay rate this Ai ambigu- 
ity cancels against a corresponding infrared renormalon 
ambiguity in the large-order behavior of the perturbation 
series in the leading order Wilson coefficients |63j |. 

We use the notation Ai for a generic short distance 
definition, which does not suffer from the renormalon 
ambiguity. Any Ai can be related to Ai by a pertur- 
bative series <5Ai ~ a s + a 2 + . . ., where Ai = Ai — SXi. 
For the kinetic scheme the u = 1 renormalon ambigu- 
ity is avoided by defining using the second moment 
of a time-ordered product of currents with an explicit 
hard cutoff regulator /if [64]]. Here Z\ and <5Ai oc /i 2 a s . 
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FIG. 14: Effect of the short distance subtractions 5Ai m for 
the kinetic scheme at NLL (dashed) and NNLL (solid) on 
S(u, 2.5 GeV) for /m = 1.0 GeV. The blue (dark) lines corre- 
spond to using mjf n together with A^ ' 6 , while orange (light) 
lines use both mf n and Ai™. 



a divergent series with terms ~ n!o£ + . However, the 
leading renormalon series ~ n\ /3q is absent, hence 
the renormalon ambiguity at lowest order in Ai is "in- 
visible" [6(J. Correspondingly, at one- loop order Z\ = 
for Lorentz invariant regulators (even for a hard cutoff), 
and generically Z\ oc CaQ. 2 |6l| . In the kinetic scheme 
the regulator is not Lorentz invariant, and hence not sup- 
pressed by "invisibility" . The same holds for definitions 
of Ai involving a lattice spacing regulator [60| . 

To avoid oversubtractions from Ai at low orders in per- 
turbation theory, we would like to define a short distance 
scheme with 5\\ ~ oq. This can be achieved by finding a 
scheme that is consistent with the suppression indicated 
by the invisible renormalon. To construct an "invisible 
scheme" for Ai we define 

X\(R) = X 1 -5X\(R), (C3) 

where the series in a s is obtained by evaluating the ma- 
trix element 



Since the kinetic energy operator mixes into b v b v , it mod- 
ifies the perturbation series multiplying b v b v , and it is 
believed that this removes the corresponding u = 1 in- 
frared renormalon in the Wilson coefficients. In Ref. [l4[ 
a "shape function" scheme for Ai was introduced based 
on the second moment of S(u>, /i) with a cutoff, which also 
has SXi oc li 2 f(x s . A potential problem with these schemes 
is that they resolve an issue with the large-order behavior 
of perturbation theory by introducing a series that starts 
with a term 5X\ ~ a s . While the inclusion of an 0(a s ) 
term is known to provide good numerical stability when 
removing the u = 1/2 renormalon from the pole mass, far 
less numerical analysis has been done on the low-order 
impact of the u = 1 renormalon in Ai. If the low-order 
series is not yet influenced by the u = 1 renormalon, then 
schemes with an 0(a s ) correction may oversubtract, and 
not improve the perturbation series. In Fig. [T3]we show 
that in our shape function analysis there is evidence that 
this is the case. 2 The dashed and solid curves are the 
NLL and NNLL results, respectively. The curves show 
S(oj,iii = 2.5 GeV) with the kinetic mass scheme, but 
use either A^ ln (blue curves) or Ai from Eq. (|C2[) (orange 
curves). In the A^ m scheme the oscillatory behavior near 
the origin at both NLL and NNLL order is indicative of 
an oversubtraction. 

Some understanding of the weakness of the u = 1 
renormalon can be obtained from analytic computations. 
For Lorentz invariant re gula tors the u = 1 renormalon 
for Ai is "invisible" (60l. l6l|. namely the ambiguity for 
Ai is smaller than dimensional analysis indicates. In 
perturbation theory renormalon ambiguities appear as 



8X\{R) = {b v \b v (W ± fb v \b v ) 



2 This observation relies on our use of F\ (u) = F(uj) in Eq. 11331 to 
determine SCo (<j). It would be interesting to explore if a different 
choice would change the conclusions drawn from Fig. 1141 



(C4) 



Here, R is a Lorentz invariant hard cutoff UV regulator, 
ensuring that 5X\(R) ~ R 2 a 2 s . The definition in Eq. ([C3]) 
states that the invisible scheme X\(R) is the kinetic en- 
ergy of the b quark in the B meson minus the free kinetic 
energy of the b quark. Since the 0(Aq CD ) ambiguity in 
Ai is a UV effect caused by the mixing of the kinetic oper- 
ator into b v b v , it is the same for the B meson and b quark 
states, and cancels out in the difference. This can be seen 
explicitly by using Eq. (jCip and noting that our states 
are normalized so that (B v \b v b v \B v ) — (b v \b v b v \b v ) = 1. 
The C(Aq CD ) ambiguity is universal to the definition of 
Ai and independent of R. Although a precise definition 
of R is needed to define the scheme, the u = 1 renormalon 
ambiguity cancels out in Ai — SX\ for any such regulator. 
We adopt a definition that allows us to use the compu- 
tation in Ref. [6l{ . where it was shown that Z\ ^ at 
0(a 2 ) in a Lorentz invariant cutoff scheme. We define 



8X\{R) = lim A„„/(0,0) 



R 



(C5) 



lim 



R rR dwdw' A , 

— —t. r Disc Disc A vv > (w,w ) , 

i \^ in in' in J 



o (27ri) 2 w w' w 



w 



where 



A vv i (v ■ k, v ■ k') 



3(Mfc')|fti^X igG^b v \b v {k)) 

(v ■ v') 2 - 1 



and as usual the discontinuity of a function is given by 
DisCu, f(w) = lim e ^ [f(w + ie) — f(w — ic)] . Note 
that it is sufficient to consider the b v 'ig G^ v b v opera- 
tor due to the virial theorem in HQET, which relates 
lirrv^,, A vv i (0, 0) to the corresponding matrix element 
of the kinetic energy operator in Eq. (|C4[) . The result for 
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the cutoff matrix element from Ref. [6l| implies 

5X\{R)=R*C F C A (^-l)^ 

= 0.232 R 2 a 2 s (n) . (C7) 
Equation (|C7[) gives the relation of the invisible X\ to the 



HQET Ai in Eq. (|C2)) , which was used in the text. We 
use R — 1 GeV as our default value. Equations (|C3[) and 
(|C5j) provide a /i independent definition for X\ (R) , so the 
/i dependence in ar s (n) will cancel against a higher order 
R 2 a 3 s (fi) \n(n/R) term in 5X\(R). 



[1] M. Neubert, Phys. Rev. D 49 (1994) 3392 [hep- 

ph/9311325]; ibid. 4623 [hep-ph/9312311]. 
[2] I. I. Y. Bigi, M. A. Shifman, N. G. Uraltsev and 

A. I. Vainshtein, Int. J. Mod. Phys. A 9 (1994) 2467 

[hep-ph/9312359]. 
[3] K. S. M. Lee, Z. Ligeti, I. W. Stewart and F. J. Tack- 

mann, Phys. Rev. D 74, 011501(R) (2006) [hep- 

ph/0512191]; 

[4] K. S. M. Lee and I. W. Stewart, Phys. Rev. D 74, 014005 

(2006) [hep-ph/0511334]. 
[5] M. Misiak et al, Phys. Rev. Lett. 98, 022002 (2007) [hep- 

ph/0609232]; and references therein. 
[6] S. Chen et al. [CLEO Collaboration], Phys. Rev. Lett. 

87, 251807 (2001) [hep-ex/0108032]. 
[7] K. Abe et al. [Belle Collaboration], arXiv:0804.1580; 

P. Koppenburg et al. [Belle Collaboration], Phys. Rev. 

Lett. 93, 061803 (2004) [hep-ex/0403004]. 
[8] B. Aubert et al. [BABAR Collaboration], Phys. Rev. D 

77, 051103(R) (2008) [arXiv:0711.4889]; Phys. Rev. Lett. 

97, 171803 (2006) [hep-ex/0607071]; Phys. Rev. D 72, 

052004 (2005) [hep-ex/0508004]. 
[9] B. Aubert et al. [BABAR Collaboration], hep- 

ex/0408068; K. Tackmann [BABAR Collaboration], 

arXiv:0801.2985. 
[10] C. W. Bauer, Z. Ligeti, M. Luke, A. V. Manohar 

and M. Trott, Phys. Rev. D 70, 094017 (2004) [hep- 

ph/0408002]; C. W. Bauer, Z. Ligeti, M. Luke and 

A. V. Manohar, Phys. Rev. D 67, 054012 (2003) [hep- 

ph/0210027]. 

[11] O. Buchmiiller and H. Flacher, Phys. Rev. D 73, 073008 

(2006) [hep-ph/0507253]. 
[12] E. Barberio et al. [Heavy Flavor Averag- 
ing Group], arXiv:0704.3575; and updates at 

http : //www. slac . Stanford. edu/xorg/hfag/ 
[13] C. Balzereit, T. Mannel and W. Kilian, Phys. Rev. D 58, 

114029 (1998) [hep-ph/9805297]. 
[14] S. W. Bosch, B. O. Lange, M. Neubert and G. Paz, Nucl. 

Phys. B 699, 335 (2004) [hep-ph/0402094]. 
[15] A. K. Leibovich, I. Low and I. Z. Rothstein, Phys. Rev. 

D 61, 053006 (2000) [hep-ph/9909404]. 
[16] A. K. Leibovich, I. Low and I. Z. Rothstein, Phys. Lett. 

B 486, 86 (2000) [hep-ph/0005124]. 
[17] A. H. Hoang, Z. Ligeti and M. Luke, Phys. Rev. D 71, 

093007 (2005) [hep-ph/0502134]. 
[18] B. O. Lange, JHEP 0601, 104 (2006) [hep-ph/0511098]. 
[19] C. W. Bauer, Z. Ligeti and M. E. Luke, Phys. Lett. B 479 

(2000) 395 [hep-ph/0002161]; Phys. Rev. D 64 (2001) 

113004 [hep-ph/0107074]. 
[20] A. H. Hoang and I. W. Stewart, Phys. Lett. B 660, 483 

(2008) [arXiv:0709.3519]. 
[21] C. W. Bauer and A. V. Manohar, Phys. Rev. D 70, 

034024 (2004) [hep-ph/0312109]. 



[22] M. Neubert, Eur. Phys. J. C 40, 165 (2005) [hep- 
ph/0408179]. 

[23] T. Becher and M. Neubert, Phys. Rev. Lett. 98, 022003 

(2007) [hep-ph/0610067]. 
[24] G. P. Korchemsky and G. Sterman, Phys. Lett. B 340, 

96 (1994) [hep-ph/9407344]; 
[25] C. W. Bauer, D. Pirjol and I. W. Stewart, Phys. Rev. D 

65, 054022 (2002) [hep-ph/0109045]. 
[26] T. Mannel and F. J. Tackmann, Phys. Rev. D 71, 034017 

(2005) [hep-ph/0408273]. 
[27] F. J. Tackmann, Phys. Rev. D 72, 034036 (2005) [hep- 

ph/0503095]. 

[28] B. O. Lange, M. Neubert and G. Paz, Phys. Rev. D 72, 

073006 (2005) [hep-ph/0504071]. 
[29] W. M. Yao et al. [Particle Data Group], J. Phys. G 33, 

1 (2006), and 2007 partial update for the 2008 edition. 
[30] A. H. Hoang, Z. Ligeti and A. V. Manohar, Phys. 

Rev. Lett. 82 (1999) 277 [hep-ph/9809423]; Phys. Rev. 

D 59 (1999) 074017 [hep-ph/9811239]; A. H. Hoang 

and T. Teubner, Phys. Rev. D 60 (1999) 114027 [hep- 

ph/9904468]. 

[31] A. Czarnecki, K. Melnikov and N. Uraltsev, Phys. Rev. 
Lett. 80, 3189 (1998) [hep-ph/9708372]; Phys. Rev. D 
57, 1769 (1998) [hep-ph/9706311]; the a 2 3 f3 part of what 
later became Ai was first obtained in: A. Kapustin, 
Z. Ligeti, M. B. Wise and B. Grinstein, Phys. Lett. B 
375, 327 (1996) [hep-ph/9602262]. 

[32] S. Fleming, A. H. Hoang, S. Mantry and I. W. Stewart, 
Phys. Rev. D 77, 114003 (2008) [arXiv:0711.2079]. 

[33] F. De Fazio and M. Neubert, JHEP 9906, 017 (1999) 
[hep-ph/9905351]. 

[34] P. Gambino, P. Giordano, G. Ossola and N. Uraltsev, 
JHEP 0710, 058 (2007) [arXiv:0707.2493]. 

[35] M. Abramowitz and I. A. Stegun, "Handbook of Math- 
ematical Functions", Dover, New York (1964); for other 
proposed expansions, see, e.g.: A. H. Hoang and R. Hof- 
mann, Phys. Rev. D 67, 054024 (2003) [hep-ph/0206201]. 

[36] C. W. Bauer, M. E. Luke and T. Mannel, Phys. Rev. D 
68, 094001 (2003) [hep-ph/0102089]. 

[37] A. K. Leibovich, Z. Ligeti and M. B. Wise, Phys. Lett. 
B 539, 242 (2002) [hep-ph/0205148]. 

[38] C. W. Bauer, M. Luke and T. Mannel, Phys. Lett. B 
543, 261 (2002) [hep-ph/0205150]. 

[39] K. S. M. Lee and I. W. Stewart, Nucl. Phys. B 721, 325 
(2005) [hep-ph/0409045]. 

[40] A. Kapustin, Z. Ligeti and H. D. Politzer, Phys. Lett. B 
357, 653 (1995) [hep-ph/9507248]. 

[41] S. J. Lee, M. Neubert and G. Paz, Phys. Rev. D 75, 
114005 (2007) [hep-ph/0609224]. 

[42] K. S. M. Lee, Phys. Rev. D 78, 013002 (2008) 
[arXiv:0802.0873]. 

[43] M. Beneke, F. Campanario, T. Mannel and B. D. Pecjak, 



26 



JHEP 0506, 071 (2005) [hep-ph/04 11395]. 
[44] M. Trott and A. R. Williamson, Phys. Rev. D 74, 034011 

(2006) [hep-ph/0510203]. 
[45] J. R. Andersen and E. Gardi, JHEP 0601, 097 (2006) 

[hep-ph/0509360]. 
[46] U. Aglietti, F. Di Lodovico, G. Ferrera and G. Ricciardi, 

arXiv:0711.0860. 
[47] M. Neubert, Phys. Rev. D 72, 074025 (2005) [hep- 

ph/0506245]. 

[48] K. S. M. Lee, Z. Ligeti, I. W. Stewart and F. J. Tack- 
mann, Phys. Rev. D 75, 034016 (2007) [hep-ph/0612156]; 

[49] K. G. Chetyrkin, M. Misiak and M. Miinz, Phys. Lett. B 
400, 206 (1997) [Erratum-ibid. B 425, 414 (1998)] [hep- 
ph/9612313]. 

[50] C. W. Bauer, S. Fleming, D. Pirjol and I. W. Stewart, 
Phys. Rev. D 63, 114020 (2001) [hep-ph/0011336]. 

[51] I. R. Blokland, A. Czarnecki, M. Misiak, M. Slusarczyk 
and F. Tkachov, Phys. Rev. D 72, 033014 (2005) [hep- 
ph/0506055]; K. Melnikov and A. Mitov, Phys. Lett. B 
620, 69 (2005) [hep-ph/0505097]. 

[52] A. Ali, B. D. Pecjak and C. Greub, Eur. Phys. J. C 55, 
577 (2008) [arXiv:0709.4422]. 

[53] T. Becher and M. Neubert, Phys. Lett. B 637, 251 (2006) 
[hep-ph/0603140]. 

[54] T. Becher and M. Neubert, Phys. Lett. B 633, 739 (2006) 



[hep-ph/0512208]. 
[55] G. P. Korchemsky and A. V. Radyushkin, Nucl. Phys. B 

283, 342 (1987). 
[56] S. Moch, J. A. M. Vermaseren and A. Vogt, Nucl. Phys. 

B 688, 101 (2004) [hep-ph/0403192]. 
[57] G. P. Korchemsky and G. Marchesini, Nucl. Phys. B 406, 

225 (1993) [hep-ph/9210281]. 
[58] E. Gardi, JHEP 0502, 053 (2005) [hep-ph/0501257]. 
[59] A. Jain, I. Scimemi and I. W. Stewart, Phys. Rev. D 77, 

094008 (2008) [arXiv:0801.0743]. 
[60] G. Martinelli, M. Neubert and C. T. Sachrajda, Nucl. 

Phys. B 461, 238 (1996) [hep-ph/9504217]. 
[61] M. Neubert, Phys. Lett. B 393, 110 (1997) [hep- 

ph/9610471]. 

[62] M. E. Luke and A. V. Manohar, Phys. Lett. B 286, 348 
(1992) [hep-ph/9205228]. 

[63] M. Beneke, V. M. Braun and V. I. Zakharov, Phys. Rev. 
Lett. 73, 3058 (1994) [hep-ph/9405304]; M. E. Luke, 
A. V. Manohar and M. J. Savage, Phys. Rev. D 51, 4924 
(1995) [hep-ph/9407407]; M. Neubert and C. T. Sachra- 
jda, Nucl. Phys. B 438, 235 (1995) [hep-ph/9407394]. 

[64] I. I. Y. Bigi, M. A. Shifman, N. G. Uraltsev and 
A. I. Vainshtein, Phys. Rev. D 52, 196 (1995) [hep- 
ph/9405410]. 



